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(54) TlUe: A SEXTANT CORE BIOPSY PREDICTIVE MECHANISM FOR NON-ORGAN CONHNED DISEASE STATUS 
(57) Abstract 

A method for screening individuals at risk for the loss of organ confinement in prostate cancer is disclosed. The method is useful for 
evaluating cells from patients at risk for recurrence of prostate cancer following surgery for prostate cancer. Specifically, the method uses 
specific Maikovian nuclear texmre factors, alone or in combination with other biomarken. to determine whether the cancer will progress 
or lose organ confinement. In addition, methods of predicting the development of fatal metastatic disease by statistical analysis of selected 
biomarkers is also disclosed. The invention also contemplates a method that uses a neural network to analyze and interpret cell morphology 
data. Utilizing Markovian factors and other biomarkers as parameters, the network is first trained with a set of cell data from known 
progressors and known non-progressors. The trained lietwork is then used to predict the loss of organ confinement by evaluating patient 
samples. 
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FOR NON-ORGAN CONFINED DISEASE STATUS 
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RACKGROUND OF THE INVENTION 
The present application is a continuation-in-part of co-pending U.S. Patent 
Application Serial No. 08/315,210, filed September 29, 1994. The entire text of the above- 
referenced disclosure is specifically incorporated by reference herein without disclaimer. 

5 

1. Field of the Invention 

The present invention relates to fields of computer-assisted quantitative image analysis 
and methods to classify cells related to cancer progression. More specifically, it concerns 
methods of multivariate statistical analysis as applied to prediction of organ-confined disease 
10 status based upon the sextant biopsy pathology, PSA and quantitative image analysis. Also 
set forth is a method of predicting non-organ-confined disease status in patients based upon 
results of tests performed prior to election of any treatment or following such treatment. 

2. Description of the Related Art 

15 Prostate cancer is diagnosed in 100/100,000 white males and in 70.1/100.000 black 

males in the United States. It is the second leading cause of male cancer deaths and the most 
commonly diagnosed cancer in men in the United States representing 21 % of all newly 
diagnosed cancers. In 1993 an estimated 165,000 men in the United States were diagnosed 
with clinically apparent prostate cancer and 35,000 will succumb to the disease. The age- 

20 specific increase in incidence achieves a maximum of 1000/100,000 in men >75 years of 
age. The lifetime risk of developing clinical prostate cancer in the U.S. is 8.7% for white 
and 9.4% for black Americans with a lifetime risk of dying being 2.6% and 4.3% 
respectively. The risk of developing prostate cancer has risen 42.6% since 1975 as 
compared to an increase of only 26% in risk of developing lung cancer for that same time 

25 period. Approximately 65% of prostate cancers are clinically localized at the time of 

diagnosis and potentially curable with standard surgical techniques, yet only 50% of men are 
found to have disease confined to the prosutc at the time of surgery. Pack and Spitz (Pack 
R. and Spitz M.A. The Cancer Bulletin, 45:384-388, 1993), reviewing the epidemiology of 
prostate cancer, indicated several definable risk factors such as age, race, dietary fat 



-2- 



wo 97/12247 PCT/US96/I5780 

consumption, vasectomy, and familial aggregation with at least a two-fold increased risk for 
first generation relatives of men with prostate cancer (rare autosomal dominant inheritance). 
These causa] correlations, though impressive, can not yet explain the complex etiology, 
biologic heterogeneity, and rapidly increasing incidence of this disease, and await further 
5 investigations of genetic, epigenetic and environmental factors. 

The monality rate for prostate cancer has been steadily increasing over the past 40 
years and will continue to do so as our population ages. This clinically evident disease 
represents only the tip of the iceberg in that nearly 30 percent of all men over age 50 harbor 
10 a silent microscopic form of latent prostate cancer. Current early detection methods are 
increasing the numbers of this latent form of cancer identified, which now represent more 
than 1 1 million cases within the male population in the United States, and growth rate studies 
indicate that these mmors appear to grow very slowly and the great majority should remain 
clinically silent. 

15 

Recent advancements in transrectal ultrasonography and the development of a serum 
based assay (prostate specific antigen, PSA) for early detection has caused the diagnosis of 
premalignant neoplasias as well as prostate cancer to increase at an alarming rate. Many of 
these newly diagnosed neoplasias could represent the non-aggressive, potentially latent form 

20 of the disease that may never have become clinically evident if followed without therapy. 
Unfortunately, no accurate and specific methods presently exist to distinguish the more 
potentially aggressive form of prostate cancer from the latent form of the disease; thus most 
patients diagnosed are presently treated as though they had the aggressive form of the 
disease. At present, the factors to be considered in assessing cancer progression are 

25 estimates and significance of tumor volume, pre- and post-operative histological grading of 
cancer and high grade intraepithelial neoplasia, clinical and pathological , stage, and serum 
prostate specific antigen (PSA) to predict biological aggressiveness of prostate cancer. These 
techniques generally have only marginal predictive value. 
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It is well accepted that the epigenetic and genetic transformation of a normal prostatic 
epithelial cell to a cancer cell with progression to a metastatic phenoiype requires multiple 
steps. The development of methods to quantify accurately these changes in order to better 
predict tumor aggressiveness has been the subject of much experimental work in prostate 
cancer. The use of chromatin texture feamre data extracted from either H&E or Feulgen 
stained sections correlate well to classification of malignant cells. However, the sensitivit}' 
of Markovian texture measurements is complicated by. the level of pixel gray level resolution 
(grain). Dawson et al. used a CAS- 100 Image Analysis System and software lo measure 22 
Markovian texture feamres at 20 levels of pixel resolution (grain) and found ten features that 
discriminated chromatin patterns in breast cancer images captured by the CAS- 100. 
Markovian analysis is a method based on determining gray-level transition probabilities and it 
allows discrimination among different nuclear texture features; the value for each feature 
depending on the level of grain resolution for each measurement. 

Christen et aL applied a linear discriminant statistical model analysis of shape, size 
and texture feamres of H&E stained prostate nuclei to a high efficiency, 93% correct 
classification of normal and abnormal cells. Also, Irinopoulou et aL employed Feulgen 
stained nuclei and a computer-assisted image analysis system to characterize digitized images 
(512 X 512 pixels, with 256 possible gray tone levels) from twenty-three patients with Stage 
B carcinoma of the prostate followed for at least three years. Using five chromatin texture 
features and discriminant analysis methodology, these patients could be divided into those 
with a good and poor prognosis. 

In spite of the progress made in predicting the organ confinement of prostate cancer 
cells, it is evident that improvements are needed in the accuracy of such determinations. A 
particular advantage would be realized by the development of methods that provide for 
accurate and reproducible statistical analysis of prognostic variables to maximize the 
aggregate positive predictive value while simultaneously reducing false negatives and false 
positives. 
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SIIMMARY OF imTENTION 
The present invention provides new and improved methods for determining the 
potential for prostate cancers to spread beyond the confines of the organ. The methods 
involve assessing treatable, localized prostate cancers using core biopsies, fine needle 
5 aspirates in order to: (1) better evaluate which patients have tumors that need any treatment, 
(2) determine the prognosis of patients with prostate cancer pathologically localized to the 
glands after surgery, so that adjuvant treatment of those patients with a high probability of 
disease progression might begin earlier in the natural course of the disease, and finally, (3) 
provide more objective means to select patients for chemoprevention trials using dietar>' 
10 modifications, retinoids, and hormonal manipulation (i.e. 5-alpha reductase inhibitors). 

For the purposes of this invention, progression is defined as recurrence of disease 
post-treatment (surgery or irradiation), for example in the case of prostate cancer, as 
determined by PSA elevation, clinical evidence of local or regional tumor recurrence, distant 
15 metastasis, or death. Organ confined disease status is defined as prostate cancer that is still 
contained within the prostate gland and has not invaded the prostatic capsule. 

The invention provides a series of pre-treatment methods applied to sextant core 
biopsies to determine the stams of disease confinement to the prostate. In certain 

20 embodiments, a statistically analyzed combination that includes quantitative nuclear image 
features selected from pathologically important tissue sections and appropriately selected 
biomarkers provides an aggregate positive predictive value with negligible false negatives and 
false positives that exceeds current conventional pathological methods for predicting non- 
organ confinement, or tumor extension beyond the prostate gland. Tht aggregate positive 

25 predictive power of the model, all parameters (i.e. quantitative nuclear image features and 
biomarkers), was achieved using clinically and pathologically well defined patient samples. 
The ultimate goal of this invention is to apply these predictive capabilities to prostate biopsies 
for determination of non-organ confined disease status. 
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This invention provides a method to collect nuclear images and extract all relevant 
nuclear morphometric descriptors (NMD's), including size, shape, texture (Markovian 
analysis), and DNA content features. Additionally, other biomarkers were used in 
combination with nuclear morphometric descriptors. Select biomarkers within the scope of 
5 the instant invention include, but are not limited to. PSA values (both total PSA and the free 
PSA to total PSA ratio), the number of positive sextant core biopsies, tumor location 
(whether base, mid, or apex involvement), the sum percent area of tumor involvement, 
Gleason score, DNA ploidy, and RT-PCR analysis of prostate marker mRNA. The NMD's 
combined with the biomarkers can then be analyzed to construct a non-parametric, non-linear 

10 mathematical model (e.g. Applying statistical methods such as logistic regression; 

discriminate analysis (Bayesian classifier or Fischer analysis); recursive partitioning methods 
(Classification and Regression Trees, or CART); or neural networks (both standard and 
proprietary)), that can yield a single predictive probability for prostate cancer organ 
confinement, with or without conventional pathological grading. The pathologically 

IS significant areas are identified by an expert trained in the identification of abnormal cells and 
tissue architecture associated with malignancies of the prostate. Such abnormalities may be 
present in core biopsies, or fine needle aspirates that have been fixed using methods that 
preserve the antigenicity of the biomolecules of diagnostic significance, cellular architecture, 
and integrity of the deoxyribonucleic acid (DNA) or chromatin. 

20 

According to the present invention, a method of predicting organ confined disease 
stams is provided, comprising the steps of first obtaining a clinical sample from a subject, 
then analyzing cell nuclei from areas selected by pathology expens and collecting the NMD's 
as well as phenotypic cellular biomarker information, and thirdly, predicting organ 
25 confinement or nonconfinement status using non-parametric statistical analysis of the data or 
the use of trained neural networks. Cell sampling for image analysis involves the selection 
of intact cell nuclei representative of the worst state of differentiation as well as, when 
present, well to moderately differentiated cancer cells. This provides a measure of tumor 
heterogeneity often present in prostate cancer. It is suggested thai at least 50% of tlie cells 
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analyzed be of the worst state of cellular differentiation present in the clinical sample, and 
that the remainder of the cancer cells analyzed represent the well to moderately differentiated 
cell population, if present, in the clinical sample. 

5 It is an object of the present invention that the more accurate staging of prostate 

cancer may spare many patients enormous morbidity and high costs of ineffectual traditional 
therapies for localized disease such as in radical prostatectomy that are inappropriately 
applied to non-organ confined disease due to inaccurate staging. Alternatively, more recent 
attempts at pre-treatment downstaging therapy may be monitored using the procedures set 

10 forth in the instant invention in combination with other repeat sextant biopsies. It is a further 
object of the invention that in addition to these protocols being used as a tool for monitoring 
patients during the course of their disease, the invention also provides more accurate disease 
staging combined with current conventional technology, which will lead to more appropriate 
early selection of patient management options, including, but not limited to surgery, radiation . 

15 therapy, chemotherapy, and gene therapy. 

In certain embodiments, the resulting data includes biomarker results (e.g. including 
but not limited to PSA values, the number of positive sextant core biopsies, tumor location 
(whether base, mid, or apex involvement), the sum percent area of nimor involvement, 

20 Gleason score, DNA ploidy. and RT-PCR analysis of prostate marker mRNA) and NMD's. 

calculated based on nuclear size, shape, and DNA content, as well as texture features derived 
by nearest neighbor relational analysis of individual pixel gray levels to mathematically 
determine several features (e.g. including, but not limited to. object sum, optical density, 
object size, object shape, picograms of DNA, angular second moment, contrast, correlation 

25 difference moment, inverse difference moment, sum average, sum variance, sum entropy, 
entropy, difference variance, difference entropy, information measure A, information 
measure B, maximal correlation coefficient, coefficient of variation, peak transition 
probability, diagonal variance, diagonal moment, second diagonal moment, product moment, 
triangular symmetry, standard deviation, cell classification (l^Hypodipioid, 2=Diploid, 
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3=S-Phase, 5=Tetraploid, 6=Hyperploid), perimeter, DNA index, density, average optical 
density, feret X. feret Y, maximum diameter, minimum diameter, elongation, run length, 
and configurable run length). For the purposes of this invention, these parameters are 
collectively referred to as prognostic parameters. 

5 

Clinical samples obtained from patients at risk for occurrence or recurrence of 
prostate cancer are analyzed and values generated for various prognostic parameters, 
including the nuclear morphometric descriptors (NMD*s). The univariant or multivariant 
NMD*s are then used to calculate a Quantitative Nuclear Grade (QNG) for the case, using a 

10 formula derived from a logistic regression model. Summary statistics from the NMD's (e.g. 
standard deviation and variance) and raw data from the biomarkers are analyzed using 
logistic regression. The logistic regression may be applied in a univariate or multivariate 
mode. As used herein, multivariate analysis means that several univariately significant 
independent variables are jointly regressed on the same dependent variable. A dependent 

15 variable refers to a clinical outcome (e.g. regional tumor recurrence, distant metastasis, or 
death), or a pathological disease sute (e.g. organ confinement status). Based upon 
significance levels, the statistical program selects only those univariately significant 
independent variables that contribute to the correct prediction of the dependent variable (e.g. 
progression or organ confined disease status). Notable is the fact that fumre changes in the 

20 model, such as measurement of NMD*s (e.g. different magnifications for collection, 

improved camera resolution, etc.) and\or additional biomarkers, may change the parameters 
needed and only improve upon the predictive power of the models by small percentages so it 
may approach 100%. 

25 It is therefore an object of the present invention to provide a method for predicting 

the loss of organ confinement that comprises first providing prostate cells from a subject, 
then analyzing various prognostic parameters in the prostate cells, the prognostic parameters 
including nuclear morphometric descriptors and selected biomarkers, and predicting the loss 
of organ confinement by statistical analysis of the predictive parameters. The predictive 
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parameters include nuclear morphometric descriptors that are univariate or multivariate 
significant, and wherein the nuclear morphometric descriptors may be selected from the 
group consisting of difference entropy, information measure A. information measure B. 
maximal correlation coefficient, angular second moment, coefficient of variation, contrast, 
5 peak transition probability, correlation, diagonal variance, difference moment, diagonal 
moment, inverse difference moment, second diagonal moment, sum average, product 
moment, sum variance, triangular symmetry, sum entropy, blobness, entropy, difference 
variance, and standard deviation. 

10 In other embodiments, the nuclear morphometric descriptors may also be selected 

from the group consisting of object sum, optical density,, object size, object shape, picograms 
of DNA, angular second moment, contrast, correlation, difference moment, inverse 
difference moment, sum average, sum variance, sum entropy, entropy, difference variance, 
difference entropy, information measure A, information measure B, maximal correlation 

IS coefficient, coefficient of variation, peak transition probability, diagonal variance, diagonal 
moment, second diagonal moment, product moment, triangular symmetry, standard 
deviation, cell classification (1 =HypodipIoid, 2 = Diploid, 3 = S-Phase, 5=TetrapIoid, 
6=Hyperploid), perimeter, DNA index, density, average optical density, ferei X, feret Y. 
maximum diameter, minimum diameter, elongation, run length, and configurable run length. 

20 The nuclear morphometric descriptors may also be Markovian nuclear texmre factors. 

Biomarkers that are contemplated within the scope of the instant invention include, 
but are not limited to Gleason score, number of positive sextant core biopsies, sum % area 
tumor involvement, and mmor location. Moreover, it is also contemplated that PSA 
25 antigenicity and RT-PCR mRNA levels of selected markers is within the scope of the 
invention. 

The invention also provides a method of predicting the loss of organ confinement 
before radical prostatectomy that comprises tlie steps of first, obtaining cells from a subject. 
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then analyzing selected predictive parameters in the cells, followed by predicting the 
recurrence of prostate cancer in the cell samples by statistical analysis of the predictive 
parameters. These predictive parameters include nuclear morphometric descriptors or selected 
biomarkers, and the statistical analysis may be univariate or multivariate statistical analysis. 

The predictive parameters arc selected from the group consisting of Gleason score, 
nuclear morphometric descriptors, serum PSA, number of positive sextant core biopsies, 
mmor location DNA ploidy, sum % area of tumor involvement, or RT-PCR mRNA levels. 
As in previous embodiments, the nuclear morphometric descriptors are selected from the 
group consisting of difference entropy, information measure A, information measure B, 
maximal correlation coefficient, angular second moment, coefficient of variation, contrast, 
peak transition probability, correlation, diagonal variance, difference moment, diagonal 
moment, inverse difference moment, second diagonal moment, sum average, product 
moment, sum variance, triangular synmietry, sum entropy, blobness, entropy, difference 
variance, and standard deviation. 

In other embodiments, the nuclear morphomeuric descriptors are selected from the 
group consisting of object sum, optical density, object size, object shape, picograms of 
DNA, angular second moment, contrast, correlation, difference moment, inverse difference 
moment, sum average, sum variance, sum entropy, entropy, difference variance, difference 
entropy, information measure A, information measure B, maximal correlation coefficient, 
coefficient of variation, peak transition probability, diagonal variance, diagonal moment, 
second diagonal moment, product moment, triangular symmetry, standard deviation, cell 
classification (1 =Hypodiploid, 2=Diploid, 3=S-Phase, 5=Tetraploid, 6=Hyperploid), 
perimeter, DNA index, density, average optical density, feret X, feret Y, maximum 
diameter, minimum diameter, elongation, run length, or configurable run length. 
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It is contemplated that the statistical analysis used in the present invention may be 
univariate or multivariate analysis, and further the statistical analysis may be performed by a 
neural network. 

In an exemplary embodiment, the instant invention is a method of predicting the loss 
of organ confinement in which cells are provided from a subject, the prognostic parameters 
include nuclear morphometric descriptors (including object sum, optical density, object size, 
object shape, picograms of DNA, angular second moment, contrast, correlation, difference 
moment, inverse difference moment, sum average, sum variance, sum entropy, entropy, 
difference variance, difference entropy, information measure A, information measure B, 
maximal correlation coefficient, coefficient of variation, peak transition probability, diagonal 
variance, diagonal moment, second diagonal moment, product moment, triangular symmetry, 
standard deviation, cell classification (1-Hypodiploid, 2=Diploid, 3-S-Phase, 
5=TetrapIoid, 6=HyperpIoid). perimeter, DNA index, density, average optical density, feret 
X, feret Y, maximum diameter, minimum diameter, elongation, run length, or configurable 
run length), PSA values, the number of positive sextant core biopsies, tumor location 
(whether base, mid, or apex involvement), the sum percent area of mmor involvement, 
Gleason score, and DNA ploidy. From statistical analysis of these parameters (such as 
multivariate analysis or neural networks) the loss of organ confinement may be predicted. In 
a funher embodiment, the predictive parameter RT-PCR analysis of mRNA is included with 
the other parameters. 

As used herein, a receiver operating characteristic (ROC) curve plots an independent 
variable's sensitivity (true positive fraction) on the y-axis against l-specificit>- (the false 
25 positive fraction) on the x-axis as the cutoff value for a predicted positive observation is 

varied. A positive observation means that the predicted probability is greater than or equal 
to an investigator selected cutoff value. The ROC curve or plot is useful for determining the 
sensitivity, specificity, and negative and positive predictive values of a single test or a 
multiparameter test. In addition, the ROC curve can be used to establish the optimum 
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threshold cutoff for a continuous variable. When quamitating the area under a ROC curve, 
an area of 1.0 means a perfect predictive value, while an area of 0.5 means no predictive 
value and is due to random chance alone. 

For the purposes of the invention, sensitivity is the fraction of observed positive cases 
that are correctly classified and equals: true positives {true positives + false negatives}. 
The positive predictive value equals: true positives h- {true positives + false positives}. 

Specificity is the fraction of observed negative cases that are correctly classified and 
equals: true negatives -r- {true negatives + false positives}. The negative predictive value 
equals: true negatives -r {true negatives + false negatives}. 

As used herein, Markovian analysis means a process by which an image (pattern 
space) is transformed into a transitional-probability space. The Markovian approach to 
texture measurement treats images as stochastic processes in the form of discrete Markovian 
fields, yielding matrices of gray-level transition probabilities. These probabilities are arrays 
of numbers that describe digital images in terms of the probability of occurrence of different 
gray levels and sequences of gray levels. One embodiment of the invention utilizes 13 
Markovian factors, 4 DNA content factors, and 5 shape factors. 

Texture is an important visual feature for many pattern recognition tasks. As used 
herein, texture describes the interdependent characteristics of pixels within a neighboring 
area. Regular texture has more or less periodic patterns, while random texture is best 
described by its "coarseness". A texmre parameter is a local statistic, meaning that the 
statistic is computable from a set of neighboring picnire points in a neighborhood that is 
small compared to the number of picture points in the entire region. 

A Markovian matrix of the present invention is constructed using a cell nucleus image 
using QDA v3.0 software on a CAS-200 Image Analysis System. Using the QDA 
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Morphology mode of CMP v3.0 and Cellsheet v.l.Od software allows the measurement and 
calculation of the features listed in Table I. Moreover, CMP v3.0 and Cellsheet*^ v.l.Od 
calculate 22 different Markovian nuclear descriptors based upon the gray-level transition 
probabilities of Feulgen stained nuclei. The Cellsheet* software may be sued to calculate the 
feanires listed in Tables I and II, respectively. It is further recognized that any program, in 
addition to CMP and Cellsheet® that calculates texture features may; be used within the 
scope of the present invention. 

The step size selected may range from 1 pixel to 256 pixels. The step size defines 
the size of the "grain" or picture point in number of pixels that is to be compared to the 
neighboring picture points. Each cell nucleus image is normalized by partitioning the image 
into eight equally frequent gray level ranges, each range consisting of an equal number of 
pixels. In other embodiments, the normalization may be done by partitioning the image into 
16, 32, 64, or any number of gray levels. This normalization process is done by first 
plotting each individual pixel optical density (gray level) that is above an operator set 
threshold against the number of pixels. This plot is divided into eight equally frequent gray- 
level ranges (optical density ranges); each range containing an equal number of pixels (FIG. 
1). This yields a normalized cell nucleus image consisting of pixels with gray-level values 
ranging from 0-7. 
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Table I 



10 



15 



20 



25 



Nuclear Morphometric Descriptors Measured Using 
CMP v3.0 in the QDA Morphology Mode. 



1 . (OBSD) Object Sum OD 

2. (OBSZ) Object Size 

3. (OBSH) Object Shape 

4. Picograms of DNA 

5. (TXAOOl) Angular Second Moment 

6. (TXBOOl) Contrast 

7. (TXCOOl) Correlation 

8. (TXDOOl) Difference Moment 

9. (TXEOOl) Inverse Difference Moment 

10. (TXFOOl) Sum Average 

11. (TXGOOl) Sum Variance 

12. (TXHOOl) Sum Entropy 

13. (TXIOOl) Entropy 

14. (TXJOOl) Difference Variance 



15. (TXKOOl) Difference Entropy 

16. (TXLOOl) Information Measure A 

17. (TXMOOl) Information Measure B 

18. (TXNOOl) Maximal Correlation Cdeff. 

19. (TXOOOl) Coefficient of Variation 

20. (TXPOOl) Peak Transition Probability 

21. (TXQOOl) Diagonal Variance 
(TXROOl) Diagonal Moment 
(TXSOOl) Second Diagonal Moment 
CTXTOOI) Product Moment 

25. (TXUOOl) Triangular Symmetry 

26. (TXVOOl) Blobness 

27. (TXW) Standard Deviation 

28. Cell Classification (l=Hypodiploid, 

2=Diploid, 3=S-Phase, 5=Tetraploid. 
6=Hyperploid) 



22 

23 
24 



NOTE: Values 5*26 are grain dependent Markovian texture features. Grain may 
be looked at as a measurement in pixels of the width of an average sized 
object. The grain values for all of these measurements were set to 1. 
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As a further step, an 8 X 8 gray-level transition matrix is constructed from the 
normalized cell nucleus image by comparing the gray-levels of neighboring picture points 
(e.g. if a given picture point has a normalized value of 4, and its neighboring picture point 
has a normalized value of 3, an entry is made in the matrix at location Row-4 and Column-3, 
and all of the entries at this location are summed). This matrix is then transformed into an 8 
X 8 conditional gray-level transition probability matrix by dividing every matrix entry by the 
total number of pixels in the cell nucleus image. This "Markovian" probability matrix 
(Equation 1) is then used to compute the 22 Markovian texture features (Table I). 

Equation 1 8X8 Conditional Grev-level Transition Probabilitv Matrix (Markovian) 



More recently, JVB Imaging (Elmhurst, IL) has written a software application called 
ILM Morphometry vl.O that can be applied to listmode files (*.ILM) generated using a CAS- 
200 Image Analysis System. This program therefore allows the measurement and calculation 
of the same features as the CMP v3.0 software (Markovian and DNA Content feamres) as 
well as eight additional features (listed in Table II). The inventors have tested this new 
software on the patient sample reported in this invention and obtained similar statistical 
model performance as with the CMP v3.0 software. A newer version of this program, tilled 
Cellsheet® vl.Od, has been used in later studies. 



M = 



I Pl(o/o) Pl(0/i) p^(pm 

Pl(1/0) Pl(1/1) Pl(1/7) I . 




where each matrix element PJi/j) is defined as the conditional 
probability of gray-level / occurring L picmre points after gray-level J 
occurs, where L is defined as the step size (or size in pixels of the 
picture point). 
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Table II 



Nuclear Morphometric Descriptors Measured Using ILM Morphometry vl.O 



10 



15 



20 



25 



30 



1. 


Object Sum Optical Density 


19. 


Coefficient of Variation 


2. 


Object Size 


20. 


Peak Transition Probability 


3. 


Object Shape 


21. 


Diagonal Variance 


4. 


Picograms of DNA 


22. 


Diagonal Moment 


5. 


Angular Second Moment 


23. 


Second Diagonal Moment 


6. 


Contrast 


24. 


Product Moment 


7. 


Correlation 


25. 


Triangular Symmetry 


8. 


Difference Moment 


26. 


Standard Deviation 


9. 


Inverse Difference Moment 


27. 


Cell Classification 




i 




(1 =HvpodiDloid. 2=DiDloid. 3==S 








Phase, 5=Tetraploid, 








6=HyperpIoid) 


10. 


Sum Average 


28. 


Perimeter 


11. 


Sum Variance 


29. 


DNA Index 


12. 


Sum Entropy 


30. 


Density 


13. 


Entropy 


31. 


Average Optical Density 


14. 


Difference Variance 


32. 


Feret X 


15. 


Difference Entropy 


33. 


Feret Y 


16. 


Information Measure A 


34. 


Maximum Diameter 


17. 


Information Measure B 


35. 


Minimum Diameter 


18. 


Maximal Correlation Coefficient 


36. 


Elongation 


NOTE: Values 5-26 are grain dependent Markovian texture features. Grain may 



be looked at as a measurement in pixels of the width of an average sized 
object. The grain values for all of these measurements were set to 1. 
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In exemplary embodiments of the invention, consideration must be given to several 
parameters, such as cell selection, magnification, pixel shape and size, camera resolution 
(number of pixels in the x and y dimension, e.g. number of pixels per finr), and Markovian 
step size, which can significantly alter nuclear morphomeiric descriptor outputs (FIG. 2). 
The NMD's have been demonstrated in this invention to be significant independent variables 
in the prediction of tumor progression and organ confined disease status. As previously 
indicated, the method for cell selection is critical because of the need to sample biologic 
tumor heterogeneity. Additionally, the total number of NMD's required to predict an 
outcome is decreased as the magnification increases (Table III; also see FIGS. 3 - 6), as well 
as significant changes in the individual NMD's required. The latter is due to an increase in 
the number of pixels per /xm^ that would enhance the resolution of calculations for the 
NMD's. Also notable, the predictive power for all outcomes using the NMD component of 
the model increases as the magnification increases. 
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Magnification Effects (40X vs. 63X) upon Number of Nuclear 
Morphometric Descriptors needed to Accurately Predict 
Progression in a Subset of 10 Progressors and 10 Non- 

Progressors 





40X 

CMP v3.0* 1 JVB vl.O** 


63 

CMP v3.0* 1 


X 

JVB vl.O** 


P Value Cutoff 


0.45 


0.50 


0.25 


0.70 


Sensitivity 


90.00% 


90.00% 


100.00% 


90.00% 


Positive Pred. Value 


90.00% 


100.00% 


76.92% 


100.00% 


Specificity 


90.00% 


100.00% 


70.00% 


100.00% 


Negative Pred. Value 


90.00% 


90.91% 
0 


100.00% 
3 


90.91% 
0 


# False Positives 

# False Negatives 
Area Under ROC Curve 


1 1 

L^^ 

1 0.9400 


1 

0.9500 


0 

0.9400 


1 

0.9600 


NMD's in Model 

# Concordant NMD's 
1 # Discordant NMD's 


6 Markovian 
& Area 

.4 


5 Markovian 

6 Sum O.D, 

Area, 
Perimeter 

3 

1 5 


5 Markovian 
& Area 

3 


4 Markovian 
& Sum O.D. 

3 

2 



* Number of CMP Nuclear Morphometric Desctlptors (NMD's) - 2 
** Number of JVB Nuclear Morphometric Descriptors (NMD's) - 3( 
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In other embodimenis, either non-parametric statistical methods, or standard or 
proprietary neural networks were used to validate the utility of NMD's and biomarkers for 
the prediction of two possible outcomes, progression and organ confined disease status. 
5 Using a clinically and pathologically well-defined retrospective patient sample diagnosed with 
localized prostate cancer, logistic regression methods were applied on a well defined patient 
sample (n = 124) (Table IV) to determine which NMD's and biomarkei-s were capable (e.g. 
statistically significant) of predicting either progression or organ confined disease status. 

10 LOGISTIC REGRESSION PROGRESSION MODEL: 

The invention applies logistic regression to select the univariately significant variables 
for progression (Table V) using the ST ATA™ statistical software package (STATA™ 
command: logistic). Next, these univariately significant variables are multivariately assessed 
using backwards stepwise logistic regression (STAT A™ command: swlogis) lo determine 

15 which independent variables (e.g. NMD's (CMP or JVB), Gleason Score, and biomarkers) 
are retained to predict progression (Tables VI. Via, VII, and Vila). 
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Table V 

Univariate Analysis for Progression Prediction using STATA™ Logistic Regression 
Independent Variable Statistical Signiflcance (p Value) 



10 


Posi Operative Gleason Score 


p £ 0.00001 




Nuclear Roundness Variance* 


p £ 0.00001 




Best CAS-200 CMP v3.0 Nuclear Morphometric Descriptors 


p < 0.00001 


15 








Best CAS-200 JVB vl .0 Nuclear Morphometric Descriptors 


. p ^ 0.00001 




CAS-200 DNA Ploidy - 1 (C-DNAl) 


p = 0.0080 


20 


CAS-200 DNA Ploidy - 10 (C-DNAIO) 


p = 0.0274 




CAS-200 DNA Ploidy - JIE (JHHDNAIO) 


p = 0.0109 




Her-2/neu Antigenicity: Focal, Diffuse, Negative (H2NFDN) 


p = 0.0147 


25 








PCNA Antigenicity 


p = 0.1600** 




PD-41 Antigenicity above a 5% cutoff 


p = 0.3045** 


30 







* Measured using DynaCell™ system at Johns Hopkins Hospital 

Not Statistically Significant. Must be less than 0.0500 to be statistically 
significant. 
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In another embodiment, the ability of Her-2/neu antigenic expression to identify high 
risk sub-populations of well to moderately differentiated Gleason grades (2 - 6) as well as 
high Gleason grades (^7) is clearly demonstrated in FIG. 19, Additionally, it was 
demonstrated that non-diploid DNA ploidy status selected out a subset of well to moderately 
differentiated Gleason grades (2 - 6) that were at risk for progression (FIG. 20). 

LOGISTIC REGRESSION ORGAN CONFINEMENT MODEL: 

This invention also applies logistic regression to select the univariately significant 
variables for organ confinement (Table VIII) using the STAT A™ statistical software package 
(STATA'^'^ command: logistic). Next, these univariately significant variables are 
multiyariaiely assessed using backwards stepwise logistic regression (ST ATA™ command: 
swlogis) to determine which independent variables (e.g. NMD's (CMP or JVB), Gleason 
Score, and biomarkers) are retained to predict organ confinement (Tables IX, Ixa. X, and 
Xa). 
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Table VIH 



Univariate Analysis for Organ Confined Disease Status Prediction using STATA™ 

Logistic Regression 



Independent Variable Statistical Significance (p Value) 



Post Operative Gleason Score 


n 

r 


< 0.00001 


Nuclear Roundness Variance* 


n 

y 


= 0.0073 


Best CAS-200 CMP v3.0 Nuclear Morphometric Descriptors 


p 


= 0.0005 


Best CAS-200 JVB vl.O Nuclear Morphometric Descriptors 


p 


= 0.0003 


CAS-200 DNA Ploidy - 1 (C-DNAl) 


p 


= 0.0703** 


CAS-200 DNA Ploidy - 10 (C-DNAIO) 


p 


= 0.0546** 


CAS-200 DNA Ploidy • JIE (JHHDNAIO) 


p 


= 0.0499 


Her-2yneu Antigenicity: Focal, Diffuse, Negative (H2NFDN) 


p 


= 0.0023 


PCNA Antigenicity 


p 


= 0.1330** 


PD-41 Antigenicity 


p 


= 0.0198 



* Measured using DynaCell™ system at Johns Hopkins Hospital 

Not Statistically Significant. Must be less than 0.0500 to be statistically 
35 significant. 
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METHODS TO OBTAIN PATDENT^SPECIFIC RESULTS: 



Logistic Regression Method - STATA^" provides a command (hgit, an 
estimated maximum-likelihood logit model) that provides the weighted coefficients for the 
statistically significant independent variables used in the multivariate model and the model 
constant. The general formulas for calculating the predictive index and predictive probability 
are as follows: 



Predictive Index (xb) = (jSo + /3ivar(l) + /3,var(2) + + 

jS,var(;2)) 



Predictive Probability (p) = e'^ / (1 -f e'^'*) 

Where: 

j3o = Formula Constant 
01 through 0„ - Weight factors for variables 1 

through n 

var(l) through var(n) = Independent variables being used in logistic regression 

model. 



The final calculation of the predictive probability provides a patient-specific value, between 0 
and 1. for the probability of a specific outcome (e.g. progression or organ confined disease 
status). The threshold value (cutoff) for the predicted probability is selected based upon the 
results of the ROC curves. Table XI illustrates patient specific NMD and multi-parameter 
(combined) predictive probabilities calculated using this method. 
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Use of Logistic Regression (logit) to 
Predict Patient-Specific Outcomes 















Combined 


Predicted 












Morphometry 


Parameters 


Outcome 












Predictive 


Predictive 


(Cutoff: 


Case I.D. 


PD41-5 


C-DNAl 


H2NFDN 


PostGL 


Probability 


Probabilities 


0.50) 


33 


+ 


Diploid 


Focal 


5 


0.14 


0.02 


0 


34 




Ab: >S+G2M 


Diffuse 


7 


0.92 


0.93 


1 


149 




Aneuploid 


Focal 


6 


0.98 


0.58 


1 


9952 




Hyppdiploid 


Diffuse 


4 


0.09 


0.00 


0 
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Neural Networks - The first network configuration to be considered was a standard 
multilayer sigmoidal network with a single hidden layer. The neural network input layer 
consisted of either 15, 28, or 30 input nodes to accommodate the input data set of either 15, 
28, or 30 measurements (NMD's and Gleason Score). The activation function in each 
5 hidden layer and output layer neuron is sigmoid. Different network configurations (number 
of hidden layer neurons) with various training termination conditions were tested. FIG. 29 
illustrates the neural network configuration used in this study. Ii was found, for the given 
training set, that the neural network classifier works best with 20 hidden layer neurons and 
when the training is terminated at approximately 1000 iterations. 

10 

The second network, configuration tested consisted of a single hidden layer as well. 
However, the non-linearity function used was the sinusoidal function (proprietary Hybrid 
network). The output layer neurons still used the sigmoidal transfer function. It had the 
same structure as the first network (see FIG. 29). A number of different frequencies were 
15 tested to find the best combination. The best frequency was found to be F =^ 0.2. 

All networks (standard and hybrid with 15, 28, or 30 inputs) were tested using the ten 
different combinations of randomly selected test (18) and training (106) cases. The threshold 
value (cutoff) used in all cases was 0.5. Table XII summarizes the results using the standard 

20 sigmoidal neural networks applied to the n=124 patient sample described in this invention 

(Tables Xlla, b. & c). Notable is the faci that this network degrades as the number of input 
feamres is increased from 15 to 28 to 30. Please note that the 28 and 30 feamre networks 
did not undergo pre-selection using logistic regression methods. The best perfonming 
network was the one labeled "15 Input Features", where the features were pre-selected based 

25 upon statistical significance using logistic regression methods. Therefore, the use of 

statistical methods to pre-select statically significant features improves network performance. 
Also it is evident from the variation in the predictive rates among the ten trained networks 
that if the number of patients is increased, the performance of the network should be 
significantly improved. 
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Sigmoidal Neural Network Comparisons: JHH-1 (n = 124) 





Progression Predictive Rates 


Network # 


15 Feature NN 


28 Feature NN 


30 Feature NN 


1 


78% 


61% 


67% 


2 


83% 


61% 


61% 


3 


89% 


50% 


50% 


4 


67% 


83% 


67% 


5 


67% 


72% 


72% 


6 


83% 


78% 


67% 


7 


89% 


78% 


72% 


8 


78% 


78% 


72% 


9 


72% 


78% 


78% 


10 


89% 


72% 


72% 




Mean 


79% 


71% 


68% 


Standard Error 


3% 


3% 


2% 


Median 


81% 


75% 


70% 


Mode 


78% 


78% 


72% 


Standard Deviation 


9% 


10% 


8% 


Variance 


1% 


1% 


1% 
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Sigmoidal Neural Network: 15 Input Features 









% Error 


% Error Non- 


Network # 


1 Predictive Rate 


Total Error 


Progressors 


Pro gressors 


1 


78% 


22% 


33% 


11% 


2 


83% 


17% 


33% 


0% 


3 


89% 


11% 


22% 


0% 


4 


67% 


33% 


55% 


11% 


5 


67% 


33% 


55% 


11% 


6 


83% 


17% 


22% 


11% 


7 


89% 


11% 


22% 


0% 


8 


78% 


22% 


33% 


11% 


9 


72% 


28% 


33% 


22% 


10 


89% 


11% 


11% 


11% 




Mean 


80% 


21% 


32% 


9% 


Standard Error 


3% 


3% 


4% 


2% 


Median 


81% 


20% . 


33% 


11% 


Mode 


89% 


11% 


33% 


11% 


Standard Deviation 


9% 


9% 


14% 


7% 


Variance 


1% 


1% 


2% 


0% 
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Sigmoidal Neural Network: 28 Input Features 









/o Jti«rror 


vo Jbrror iNon- 


Network # 


Predictive Rate 


Total Error 


Progressors 


Progressors 


I 


61% 


39% 


55% 


22% 


2 


61% 


39% 


55% 


22% 


3 


50% 


50% 


67% 


33% 


4 


83% 


17% 


33% 


0% 


5 


72% 


28% 


44% 


11% 


6 


78% 


22% 


33% 


11% 


7 


78% 


22% 


33% 


11% 


8 


78% 


22% 


33% 


11% 


9 


78% 


22% 


44% 


0% 


10 


72% 


28% 


44% 


11% 










Mean 


71% 


29% 


44% 


13% 


Standard Error 


3% 


3% 


4% 


3% 


Median 


75% 


25% 


44% 


11% 


Mode 


78% 


22% 


33% 


11% 


Standard Deviation 


10% 


10% 


12% 


10% 


Variance 


1% 


1% 


1% 


1% 
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Sigmoidal Neural Network: 30 Input Features 









% Error 


% Error Non- 


Nehvork # 


Predictive Rate 


Total Error 


Progrcssors 


Progressors 


1 


67% 


33% 


44% 


22% 


2 


61% 


39% 


67% 


11% 


3 


50% 


50% 


67% 


33% 


4 


67% 


33% 


67% 


0% 


5 


72% 


28% 


44% 


11% 


6 


67% 


33% 


44% 


22% 


7 


72% 


28% 


44% 


11% 


8 


72% 


28% 


33% 


22% 


9 


78% 


22% 


44% 


0% 


10 


72% 


28% 


44% 


11% 






Mean 


68% 


32% 


50% 


14% 


Standard Error 


2% 


2% 


4% 


3% 


Median 


70% 


31% 


0% 


11% 


Mode 


72% 


28% 


0% 


11% 


Standard Deviation 


8% 


8% 


12% 


10% 


Variance 


1% 


1% 


2% 


1% 
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Table XIII illustrates the results for the Hybrid neural network using the n= 124 
patient sample described in this invention (Tables Xllla, b, & c). The same observations as 
made for the standard sigmoidal network above apply to the Hybrid neural network. In 
conclusion, the use of appropriately trained standard or Hybrid neural networks can be used 
to predict patient specific outcomes (e.g. progression or organ confined disease status). 

When predicting organ confined disease, one would first provide a neural network; 
then train the neural network using a first set of prognostic parameters obtained from cells 
known to lose organ confinement and a second set of prognostic parameters obtained from 
cells known not to lose organ confinement, followed by analyzing prognostic parameters in 
tumor cells of an individual having an unknown state of organ confinement and predicting the 
loss of organ confinement in cells of the individual having an unknown state of organ 
confinement using the prognostic parameters and the trained neural network. 
The prognostic parameters may be the number of positive sextant core biopsies, sum area of 
mmor involvement, and quantitative nuclear grade. In other embodiments, the prognostic 
parameters may be serum PSA, tumor location, number of positive sextant core biopsies, 
sum area of tumor involvement, quantitative nuclear grade, DNA ploidy. and post operative 
Gleason score. 

The neural network may employ the prognostic parameters that include the 
quantitative nuclear grade group comprising difference entropy, information measure A. 
information measure B, maximal correlation coefficient, angular second moment, coefficient 
of variation, contrast, peak transition probability, correlation, diagonal variance, difference 
moment, diagonal moment, inverse difference moment, second diagonal moment, sum 
average, product moment, sum variance, triangular symmetry, sum entropy, blobness, 
entropy, difference variance, and standard deviation. In still further embodiments, the 
quantitative nuclear grade parameters are object sum, optical density, object size, object 
shape, picograms of DNA, angular second moment, contrast, correlation, difference moment, 
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inverse difference moment, sum average, sum variance, sum entropy, entropy, difference 
variance, difference entropy, information measure A, information measure B, maximal 
correlation coefficient, coefficient of variation, peak transition probability, diagonal variance, 
diagonal moment, second diagonal moment, product moment, triangular symmetry, standard 
deviation, cell classification (l=Hypodiploid, 2=DipIoid, 3=S-Phase, 5:=Tetrapioid, 
6=Hyperploid), perimeter, DNA index, density, average optical density, feret X. feret Y, 
maximum diameter, minimum diameter, elongation, run length, and configurable run length. 

The neural network of the instant invention may also include additional biomarkers, 
such as the determination of RT-PCR mRNA levels. The network may be of the back 
propagation type, or it may be of the hybrid type. 

Classification and Regression Trees (CART) - 
Application of recursive partitioning methods using the SYSTAT CART™ for DOS vl.02 
(Evanston, IL) software program was performed. This method is another example of a non- 
parametric statistical classifier. The use of this method yields similar classification results 
using the well defined patient sample (h=124), and can generate a patient specific outcome 
using a trained CART. Those experienced in the art of non-parametric statistical classifiers 
realize that several other such methods exist and can be applied to achieve this same end. 
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Hybrid Neural Network Comparisons: JHH-1 (n = 124) 





Progression Predictive Rates 


Network # 


15 Feature NN 


28 Feature NN 


30 Feature NN 


1 


67% 


72% 


67% 


2 


67% 


56% 


67% 


3 


83% 


56% 


56% 


4 


67% 


56% 


67% 


5 


72% 


67% 


78% 


6 


89% 


72% 


72% 


7 


89% 


67% 


78% 


8 


78% 


72% 


67% 


9 


72% 


72% 


67% 


10 


78% 


67% 


78% 








Mean 


76% 


68% 


70% 


Standard Error 


3% 


1% 


2% 


Median 


75% 


67% 


67% 


Mode 


67% 


67% 


67% 


Standard Deviation 


9% 


4% 


7% 


Variance 


1% 


0% 


0% 
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Hybrid Neural Network: 15 Input Features 



Network # 


Predictive Rate 


Total Error 


% Error 
Progressors 


% Error Non- 
Progressors 


1 


67% 


33% 


44% 


22% 


2 


67% 


33% 


55% 


11% 


3 


83% 


17% 


22% 


11% 


4 


67% 


33% 


44% 


22% 


5 


72% 


28% 


44% 


11% 


6 


89% 


11% 


22% 


0% 


7 


90% 


11% 


22% 


0% 


8 


78% 


22% 


33% 


11% 


9 


72% 


28% 


33% 


22% 


10 


78% 


22% 


22% 


22% 



Mean 


76% 


24% 


34% 


13% 


Standard Error 


3% 


3% 


4% 


3% 


Median 


75% 


25% 


33% 


11% 


Mode 


67% 


33% 


22% 


22% 


Standard Deviation 


9% 


9% 


12% 


9% 


1 Variance 


1% 


1% 


1% 


1% 
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Hybrid Neural Network: 28 Input Features 



Network # 


Predictive Rate 


Total Error 


% Error 

Progressors 


% Error Non- 
Progressors 


1 


72% 


28% 


33% 


22% 


2 


56% 


44% 


55% 


33% 


3 


56% 


44% 


67% 


22% 


4 


56% 


44% 


67% 


22% 


5 


67% 


33% 


44% 


22% 


6 


72% 


28% 


44% 


11% 


7 


67% 


33% 


55% 


11% 


8 


72% 


28% 


33% 


22% 


9 


72% 


28% 


44% 


11% 


10 


67% 


33% 


55% 


11% 



Mean 


66% 


34% 


50% 


19% 


Standard Error 


2% 


2% 


4% 


2% 


Median 


67% 


33% 


50% 


22% 


Mode 


67% 


28% 


55% 


11% 


Standard Deviation 


7% 


7% 


12% 


7% 


Variance 


1% 


1% 


1% 


1% 
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Hybrid Neural Network: 30 Input Features 









% Error 


% Error Non- 


Network if 


Predictive Rate 


Total Error 


, Progressors 


Progressors 


1 


67% 


33% 


44% 


22% 


2 


67% 


33% 


44% 


22% 


3 


56% 


44% 


78% 


11% 


4 


67% 


33% 


55% 


11% 


5 


78% 


22% 


33% 


11% 


6 


72% 


28% 


. 33% 


22% 


7 


78% 


22% 


22% 


22% 


8 


67% 


33% 


44% 


22% 


9 


67% 


33% 


55% 


11% 


10 


78% 


22% 


33% 


11% 




Mean 


70% 


30% 


44% 


17% 


Standard Krror 


2% 


2% 


5% 


2% 


Median 


67% 


33% 


44% 


17% 


Mode 


67% 


33% 


44% 


22% 


Standard Deviation 


7% 


7% 


16% 


6% 


Variance 


0% 


0% 


2% 


0% 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The following drawings form part of the present specification and are included to 
further demonstrate certain aspects of the present invention. The invention may be better 
understood by reference to one or more of these drawings in combination with the detailed 
description of specific embodiments presented herein, 

FIG. 1 - Normalization plot of each individual pixel optical density (gray level), 
divided into eight equally frequent gray-level ranges (optical density ranges); each range 
containing an equal number of pixels 

FIG. 2 - Parameters including cell selection, magnification, pixel shape and size, 
camera resolution, and Markovian step size that can significantly alter nuclear morphometric 
descriptor outputs 

FIG. 3 shows the predictive power using the combined CMP NMD's measured with a 
40X objective. Using 7 different CMP NMD*s, a ROC curve was produced with an area 
under the curve of 94%. 

FIG. 4 shows the predictive power using the combined JVB NMD's measured with a 
40X objective. Using 8 different JVB NMD's, a ROC curve was produced wiih an area 
under the curve of 95%. 

FIG. 5 shows the predictive power using the combined CMP NMD*s measured with a 
63X objective. Using 6 different CMP NMD's, a ROC curve was produced with an area 
under the curve of 94%. 
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FIG. 6 shows the predictive power using the combined JVB NMD's measured with a 
63X objective. Using 5 different JVB NMD's. a ROC curve was produced with an area 
under the curve of 96%. 

FIG. 7 - Illustrates the predictive power of Post Operative Gleason Score as a single 
independent variable to predict progression. Please refer to Column A of Table VI. 

FIG. 8 - Demonstrates the ability of the Post Operative Gleason Score alone to 
stratify progressors and non-progressors using a Kaplan-Meier Survival (Recurrence) Curve. 
This is the pathologic standard against which all models are tested. 

FIG. 9 - Illustrates the predictive power of Nuclear Roundness Variance (as measured 
by the DynaCell System at lOOX) to predict progression. Please refer to Column B of Table 
VI. 

FIG. 10 - Demonstrates the ability of Nuclear Roundness Variance alone to stratify 
progressors and non-progressors using a Kaplan-Meier Survival (Recurrence) Curve. 

FIG. 11 - Illustrates the predictive power of the 12 CMP NMD's to predict 
progression. Please refer to Column D2 of Table VI. 

FIG. 12 - Demonstrates the ability of the 12 CMP NMD's to stratify progressors and 
non-progressors using a Kaplan-Meier Survival (Recurrence) Curve. 

FIG. 13 - Illustrates the predictive power of the 13 CMP NMD's, 1 biomarker, NRV. 
and Gleason Score combined to predict progression. Please refer to Column N of Table VI. 
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FIG. 14 - Demonstrates the ability of the 13 CMP HMD's. 1 biomarker, NRV, and 
Gleason Score combined to stratify progressors and non-progressors using a Kaplan-Meier 
Survival (Recurrence) Curve. 

FIG. 15 - Illustrates the predictive power of the 19 JVB NMD*s to predict 
progression. Please refer to Column D2 of Table VII. Also, please note the difference in 
the number of features required for JVB NMD's as well as an increase the predictive power 
of the model as compared to CMP NMD*s in FIG. 10. 

FIG. 16 - Demonstrates the ability of the 19 JVB NMD 's to stratify progressors and 
non-progressors using a Kaplan-Meier Survival (Recurrence) Curve. Please note the 
difference in the ability of JVB NMD's to stratify progressors and non-progressors as 
compared to CMP NMD's in FIG. II. 

FIG. 17 - Illustrates the predictive power of the 14 JVB NMD's, 2 biomarkers, NRV, 
and Gleason Score combined to predict progression. Please refer to Column N of Table VII. 

FIG. 18 - Demonstrates the ability of the 14 JVB HMD's, 2 biomarkers, NRV, and 
Gleason Score combined to stratify progressors and non-progressors using a Kaplan-Meier 
Survival (Recurrence) Curve. 

FIG. 19 - Stratification of progressors among well to moderately differentiated 
prostate cancers using Her-2/neu antigenic expression. 

FIG. 20 - Stratification of progressors among well to moderately differentiated 
prostate cancers using DNA ploidy cytometry. 

FIG. 21 - Illustrates the predictive power of Post Operative Gleason Score to predict 
organ confined disease status. Please refer to Column A of Table IX. Note the lower 
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predictive value of this independent variable as compared to the same variable used to predict 
progression (see FIG. 6). 

FIG. 22 - Illustrates the predictive power of Nuclear Roundness Variance to predict 
organ confined disease sianas. Please refer to Column B of Table IX. Once again note the 
much lower predictive value of this independent variable compared to its contribution in 
prediction progression (see FIG. 9). 

FIG. 23 - Illustrates the predictive power of the 10 CMP HMD's to predict organ 
confined disease status. Please refer to Column D2 of Table IX. Note the significant 
improvement of the CMP HMD's alone as compared to NRV alone (FIG. 22) in the 
prediction of organ confined disease status. 

FIG. 24 - Illustrates the predictive power of the 12 CMP HMD's, 3 biomarkers, and 
NRV to predict organ confined disease status. Please refer to Column M of Table IX. 

FIG. 25 - Illustrates the predictive power of the 15 JVB NMD's to predict organ 
confined disease status. Please refer to Column D2 of Table X. Note the improvement of 
predictive power when using JVB NMD's alone as compared to CMP HMD s alone (FIG. 
22) in the prediction of organ confined disease status. 

FIG. 26 - Illustrates the predictive power of the 15 JVB NMD*s and Post Operative 
Gleason score to predict organ confined disease status. Please refer to Column H of Table 
X. 

FIG. 27 - Illustrates the predictive power of the 16 JVB NMD's, 3 biomarkers, and 
Post Operative Gleason score to predict organ confined disease status. Please refer to 
Column N of Table X. Note that NRV was dropped as a significant independent variable in 
this model. 
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FIG, 28 - Classification scheme for prostate image analysis, 
FIG. 29 - Neural network configuration. 

FIG. 30 - Pre-operative PSA sensitivities and specificities for sextant biopsy cases 
with n=227. 

FIG. 31 - Shows the predictive power using the quantitative nuclear grade with 
variance of 22 texture features, a ROC curve was produced with an area under the curve of 
73%. 

FIG. 32 - Organ confmemeni as related to Quantitative Nuclear Grade, varying 22 
textual features. 

FIG. 33 - Shows the predictive power of the system without PSA, predicting organ 
confined disease status. 

FIG. 34 - Shows the predictive power of the system with PSA, predicting organ 
confined disease status. 

FIG. 35 - Shows the predictive power of quaniiiaiive nuclear grade and Gleason 

score. 

FIG. 36 • Shows the power of the system with PSA to predict true positives, false 
positives, true negatives, and false negatives. 
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DETAILED DESCRIPTION 



The invention, in its broadest sense, is a method for predicting organ confined disease 
status or the potential for progression of prostate cancer following radical surgery using 
either non-parametric statistical analysis methods or neural networks. The parameters 
assessed by these methods include, but are not limited to, cellular biomarkers and nuclear 
morphometric descriptors. The invention provides a method to collect nuclear images and 
extract all relevant shape, size, Markovian texture, and DNA content features important to 
construction of a mathematical method that gives a single predictive probability for prostate 
cancer progression or organ localization, with or without pathological grading. The texmre 
features utilized in the present invention are set fonh in Table XV (CellSheet® v.l.Od). It is 
recognized that in predicting the probability of prostate cancer progression and organ 
localization, prognostic variable factors other than those listed may be used within the scope 
and spirit of the present invention. 

Also embodied in the present invention is the use of a trained neural network to 
provide a single predictive probability for prostate cancer progression or organ localization 
given any number of inputs. The multi-layer perceptron network of the present invention is 
a feed-forward network with one or more hidden layers of neurons between the input and 
output layers. Using this archiiecmre, many shoncomines of the single layer perceptron are 
avoided. However, because of the added complexity, the convergence theorem and weight 
adjustment procedure suggested by Rosenblatt is not applicable. An alternate procedure 
called "back propagation" has been independently developed by Werbos (Werbos, Ph.D. 
Thesis, Harvard University, 1974), Parker (Parker, Innovation Report, 581-664, File 1, 
Office of Technology Licensing, Stanford University, October, 1982), and Rumelhart (see 
Rumelhart et aL , Parallel Distributed Processing Explorations in the Microstructures of 
Cognition Vol. 1, Foundations, MIT Press, Cambridge, Mass., 1988). This procedure is 
effective and allows for efficient use of multi-layer perceptrons. But the procedure does not 
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guarantee convergence to the global minima at all times. Also, it requires a large number of 
training iterations in order to learn a given set of transformations. 



Because of the problems associated with back propagation, it is of interest to modify 
the weight adjustment procedure and/or the model developed by Rosenblatt to enable single- 
layer perceptrons to solve problems such as XOR problems. In this work, a modified 
perceptron is utilized. The modified percepiron used is a multiple threshold perceptron thai 
is capable of solving XOR problems. This modified perceptron is obtained by changing the 
non-linearity function. Unlike previous efforts in developing multiple threshold perceptrons, 
the perceptron of the present invention is capable of handling both binary and analog inputs. 
The procedure requires fewer number of iterations to develop appropriate input to output 
transformations when compared to back propagation. 

For the purposes of this invention, the following clinical and pathological staging 
criteria is used. The use of other criteria does not depart from the scope and spirit of the 
invention. 

TO - No evidence of Prostatic tumor. 

Tl - Clinically inapparent tumor, non-palpable nor visible by imaging. 

Tla - Tumor is incidental histologic finding with three or fewer microscopic 
foci. Non-palpable, with 5% or less of TURP chips (trans-urethral 
resected prostate tissue) positive for cancer. 

Tib - Tumor is incidenul histologic finding with more than three microscopic 
foci. Non-palpable, with greater then 5% of TURP chips (trans- 
urethral resected prostate tissue) positive for cancer. 
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Tic - Tumor is non-palpable, and is found in one or both lobes by needle 
biopsy diagnosis. 



T2 - Tumor is confined within the prostate. 

T2a - Tumor present clinically or grossly, limited to the prostate, tumor 

1.5cm or less in greatest dimension, with normal tissue on at least three 
sides. Palpable, half of 1 lobe or less. 

T2b - Tumor present clinically or grossly, limited to the prostate, tumor more 
than L5cm in greatest dimension, or in only one lobe. Palpable, 
greater than half of 1 lobe but not both lobes. 

T2c - Tumor present clinically or grossly, limited to the prostate, tumor more 
than 1.5cm in greatest dimension, and in both lobes. Palpable, 
involves both lobes. 

T3 - Tumor extends through the prostatic capsule. 

T3a - Palpable tumor extends unilaterally into or beyond the prostatic 
capsule, but with no seminal vesicle or lymph node involvement. 
Palpable, unilateral capsular penetration. 

T3b - Palpable tumor extends bilaterally into or beyond the prostatic capsule, 
but with no seminal vesicle or lymph node involvement. Palpable, 
bilateral capsular penetration. 
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T3c • Palpable tumor extends unilaterally and or bilaterally beyond the 

prostatic capsule, with seminal vesicle and/or lymph node involvement. 
Palpable, seminal vesicle or lymph node involvement. 

5 T4 - Tumor is fixed or invades adjacent structures other than the seminal vesicles or 

lymph nodes. 

T4a • Tumor invades any of: bladder neck, external sphincter, rectum. 
10 T4b - Tumor invades levator muscles and/or is fixed to pelvic wall. 



The following examples are included to demonstrate preferred embodiments of the 
invention. It should be appreciated by those of skill in the art that the techniques disclosed 
in the examples that follow represent techniques discovered by the inventor to function well 
15 in the practice of the invention, and thus can be considered to constitute preferred modes for 
its practice. However, those of skill in the art should, in light of the present disclosure, 
appreciate that many changes can be made in the specific embodiments that are disclosed and 
still obtain a like or similar result without departing from the spirit and scope of the 
invention. 

20 

Example I: DNA Staining Procedure Using CAS Quantitative DNA Staining Kit 
(Elmhurst, IL; Catalog #102300-01) 

Preparation of Feulgen Stain Solution: 

25 

Place 90 ml of Type I HjO in a volumetric flask and add 10 ml of IN HCL. 
Place a stir bar in a 125 ml Erlenmeyer flask and add the above solution. Add 1 vial 
of DNA stain reagent to the flask while stirring the solution. Place a rubber stopper 
in the flask, and stir the contents for at least 1 hour. This Feulgen stain solution 
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should be filtered through a Whatman No. 1 filter immediately before staining of the 
specimen. 



Preparation of Feulgen Rinse Solution: 

Place 285 ml of Type I HjO in* a 500 ml graduated cylinder and add 15 ml of 
IN HCL. Pour this solution into a 500 ml bottle. Immediately before rinsing, place 
1 vial of DNA rinse reagent into the bottle and mix the contents by swirling. This 
solution is stable for 2-3 hours. 

Preparation of Calibration slides: 

To prepare the control cells, place two (2) CAS calibration slides (Elmhurst, 
IL; Catalog #102202-00) in 10% neutral buffered formalin for 30 minutes at room 
temperature. The calibration slides are touch-prep rat hepatocytes that have a known 
shape, size, and DNA amount. Next, rinse the CAS calibration slides in running 
deionized H2O for 5 minutes. 

Preparation of Tissue Samples: 

The 5 firti formalin fixed, paraffin embedded, tissue sections are first placed 
on Probe-On™ Plus microscope slides. Place the slides in Hemo-De for 1 minute ai 
450C, and then drain the Hemo-De from the slides with the aid of absorbent paper. 
This step is repeated three (3) more times. 

Next, place the specimen slides in absolute ethanol for 1 minute at room 
temperature, and then drain the alcohol from the slides with the aid of absorbent 
paper. Repeat this step one (1) more time. 
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Finally, place the specimen slides in PBS (Ph 7.6) with 0.17c Triton X-100 for 
10 seconds at room temperature, and then drain the PBS from the slides with the aid 
of absorbent paper. Repeat this step one (1) more time. 

Feulgen Staining Procedure: 

Place the slides (CAS calibration slides and specimen slides) in 5N HCL for 1 
hour at room temperature. Next, place all of the slides in the Feulgen stain solution 
for 1 hour at room temperature (stir while staining). Drain the Feulgen stain solution 
and rinse the slides in the Feulgen rinse solution for 30 seconds at room temperamre» 
followed by rins'mg the slides in Feulgen rinse solution for 5 minutes, at room 
temperature, followed by rinsing the slides in Feulgen rinse solution for 10 minutes at 
room temperature. The slides are then rinsed in running deionized H^O for 5 
minutes. Destaining is done in 1% acid alcohol for S minutes at room temperature. 
This is followed by dipping the slides in 95% ethanol 10 times, followed by dipping 
the slides in absolute ethanol 10 times, followed by finally dipping the slides in xylene 
10 times. Place a cover slip on the slides using a toluene or xylene based mounting 
media. 

Example II: Collection and Processing of CAS-200 CMP v3.0 Nuclear Morphometric 
Descriptors (40X Objective) 

The morphometry dau from the radical prostatectomy specimens is captured using the 
Cell Measurement Program v3.0 (CMP v3.0) software from a CAS-200 Image Analysis 
System. First, a smdy is set up in CMP v3.0 using the QDA Morphology Mode. The QDA 
Morphology Mode of CMP v3.0 allows the measurement of the Sum O.D.. size, shape, cell 
class, and the 22 Markovian texture feamres (a step size of 1 was used in this invention) for 
each cell (see Table I), as well as the generation of a DNA histogram through the use of the 
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QDA v3.0 software program on the CAS-200 Image Analysis System. Once the study is set 
up, the CMP v3.0 program (under the QDA Morphology Mode) activates the QDA v3.0 
program, and the optical system is calibrated using the CAS calibration slides that were 
stained with the specimen slides. At least 20 calibration cells are measured, with a 
5 calibration peak percent coefficient of variation (%C.V.) of less than 2.0%. (NOTE: If the 
%C.V. is greater than 2.0%, a problem has occurred in the staining process.) Next, at least 
125 cancer cells are analyzed using the method described in Example IV, and the cell nuclear 
images captured from each Sfim Feulgen stained tissue section, with all of the sum 6.D., 
size, shape, and Markovian texture measurements being saved to a CMP v3.0 vector 

10 (*.VEC) file. The nuclear cell images and DNA content information are saved to a QDA 
v3.0 listmode (*.ILM) file. The CMP vector file (*.VEC) is then convened to a Lotus 
file (*.WK1) using the CMP Exporting Utility (a feamre of the CMP v3.0 software). The 
DNA content information contained in the listmode file is extracted with specially written 
software and saved to a comma delimited text file. The Lotus 1-2-3 file (*.WK1) is then 

15 transferred to a 486 PC equipped with Windows v3.1 and Excel v5.0 for Windows, and an 
Excel v5.0 macro file is used to convert the Lows 1-2-3 file (*.WK1) into separate Excel 
v5.0 files (*.XLS) for each case, each file containing the following information for every cell 
captured from that particular specimen: the sum O.D, size, shape, cell class, 22 Markovian 
texture feamres, and DNA content; (referred to collectively as CMP Nuclear Morphometric 

20 Descriptors, or CMP NMD's). Each Excel v5.0 file (*,XLS) also contains the means, 

standard deviations, variances, minima, and maxima for each CMP NMD. In addition, the 
macro creates a summary file containing the above statistics for each CMP NMD from every 
case. 

25 Example III: Collection and Processing of JVB ILM Morphometry vl.O Nuclear 
Morphometric Descriptors 

The morphometry data from radical prostatectomy specimens is captured from the 
saved listmode files (♦JLM) using the JVB ILM Morphometry vl.O software program. 
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which allows the measurement and calculation of up to 36 different features. The listmode 
files (*.ILM) are created using the QDA v3.0 software from a CAS-200 Image Analysis 
System. The optical system is calibrated using the CAS calibration slides that were stained 
with the specimen slides by measuring at least 20 calibration cells, with a calibration peak 
percent coefficient of variation (%C.V.) of less than 2.0%. (NOTE: If the %C.V. is 
greater than 2.0%, a problem has occurred in the staining process.) Next, at least 125 
cancer cells are analyzed using the method described in Example IV, and the cell nuclear 
images captured from each 5^m Feulgen stained tissue section. The DNA content 
information and cell nuclear images are saved to a listmode (*.ILM) file. The listmode files 
(*.ILM) are then transferred to a 486 PC equipped with Windows v3.1 and Excel v5.0 for 
Windows, and converted using the JVB ILM Morphometry vl,0 program into 36 
measurements (collectively referred to as JVB Nuclear Morphometric Descriptors, or JVB 
NMD's), which are contained in a Microsoft Access Database file (*.MDB). These 36 
measurements include the sura O.D., size, shape, DNA content, 22 Markovian texture 
features, and nuclear shape features (see Table II). The Microsoft Access Database file 
(*.MDB) is then converted to an ASCII comma delimited file (*.CSV) using a conversion 
feature of the JVB ILM Morphometry vl.O program. Finally, using Excel v5.0, an Excel 
v5.0 macro file is used to convert the ASCII comma delimited file (*.CSV) into separate 
Excel v5.0 files (*.XLS) for each case, each file containing the JVB NMD's for every cell 
captured from that particular specimen. Each Excel v5.0 file ('•'.XLS) also contains the 
means, standard deviations, variances, minima, and maxima for each JVB NMD. In 
addition, the macro creates a summary file containing the above statistics for each JVB NMD 
from every case. 

Example IV: Cancer Cell Selection Method 

The inventors used a cell selection process for the radical prostatectomy specimens 
that seemed to introduce the least amount of bias and took into account the heterogeneity of 
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prostate cancer tumors. The tumor area must first be identified by an expert pathologist. 
Once the tumor area{s) have been identified, a minimum of 25 image fields and a maximum 
of 5-6 cells per image field must then be analyzed and the cell nuclear images captured. The 
cells selected may not be overlapping, and they may not contain any "holes", which is the 

5 result of an improper background threshold setting. Sample the entire circled tumor area. 
The best way to do this is to mentally partition the circled tumor area into four separate 
quadrants, and then measure a minimum of 6-7 image fields per quadrant. In each quadrant, 
select image fields from the "worst" (e.g. Highest grade) cancer areas. (NOTE: The 
"worst" area in each quadrant may vary from low grade, well differentiated cancer to high 

10 grade, poorly differentiated cancer. Just be sure to measure from the "worst" area in each of 
the four quadrants.) Once you have collected the required number of cells, save the DNA 
information and nuclear images to a listmode file. 

Example V: Analysis of CAS-200 DNA Histograms 

15 

The DNA histograms were interpreted and classified by three different methods by 
the consensus of five individuals. 

The three different methods employed cut-offs based upon the results of a DNA 
20 Consensus meeting held at Prautz Neck, ME in 1992 (Shankey, T.V. et al. Cytometry 
14:497-500, 1993). The histograms were interpreted by four different individuals, and a 
consensus DNA ploidy classification agreed upon. The classification methods are as follows: 
(See FIG. 28) 

25 DNA-1 = (0) Diploid; (1) ONR: Hypodiploid; (2) ONR: 

>S-fG2M (11-21%); (3) Abnormal: >S + G2M 
(>21%); (4) Abnormal: Aneuploid; and (5) Abnormal: 
Tetraploid 
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DNA-IO = (0) Diploid and ONR: Hypodiploid; (1) ONR: >S+G2M (11- 
21%); (2) Abnormal: >S+G2M (^21%), Aneuploid, and 
Tetraploid 

DNA-10 = (0) Normal and Oui of Normal Range; (1) Abnormal 

The three different methods employed cut-offs determined by the inventors. The 
histograms were interpreted, and the classification methods are as follows: 

JHHDNA = (0) Diploid; (1) Tetraploid; (2) Aneuploid 

JHHDNAIO = (0) Diploid: (1) Non-Diploid (i.e. Tetraploid and Aneuploid) 

JHH% > 2N = Percentages of S-Phase and Tetraploid fractions combined from 
ploidy determinations, 

' For statistical analysis, each classification scheme coded every subclass as a resuh for 
each patient (i.e. GDI DNA Ploidy: Diploid = 0, Hypodiploid = 1. ONR: >S + G2M = 
2, Tetraploid = 5, Normal = 0, Abnormal = 1; JHH DNA Ploidy: Diploid = 0. 
Tetraploid = 1, Non-Diploid = 1, etc.). The JHH%>2N classification method used the 
percentage as a result for each patient. These coded results were used for statistical analysis. 

Example VI: Nuclear Roundness Factor Measurement and Calculation of Variance 

Definition of Nuclear Roundness Factor 

The nuclear roundness factor represents a dimensionless, size-invariani shape 
descriptor that is based on the ratio of the calculated radius for the measured 
perimeter divided by the calculated radius for the measured area of the nucleus. This 
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descriptor yields a low value of 1.00 for a perfect circle and increases as the shape of 
the nucleus deviates from circularity. In mathematical terms: 

Perimeter (P) = Itt^ and Area (A) = irr/ 

Solve Equations for the Radius 

(rp) = (P)/(27r) and (r.) = >/A/ir 

Substitute Radius Equations into Nuclear Roundness Factor Equation 

Nuclear Roundness Factor (NRF) = Tp/r, = ((P)/(27r)) / (VA/ir ) 

The variance in the nuclear roundness (NRV) was calculated using the following 
formula: 

(NRV) Variance = E(Yj - Y)^ / n 

n = Number of cells measured 
j = The cell 

Yj = The nuclear roundness factor of the j ^ cell 
Y = The average or mean nuclear roundness factor for all of the 
cells 

Measurement of Nuclear Roundness using the DynaCELL^' System 

Histologic tissue sections (5-6 ysti) were cut from re-embedded paraffin blocks 
of radical prostatectomy specimens. Multiple sections were cut (thirty sections per 
specimen), and one set of slides from each specimen were stained with Hematoxylin 
and Eosin (Sakura Diversified Stainer, Model DRS-601) and Feulgen stain (Cell 
Analysis Systems, Elmhurst. IL.)« The H&E staining procedure was performed on 
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sections #1, 10, 20, and 30 for purposes of pathology review to confirm the presence 
of cancer for additional biomarker studies. All pathologic radical prostatectomy 
specimens were assigned Gleason scores (sum). The nuclear roundness factor 
measurements were performed using the H&E sections. A total of 150 cancerous 

' 5 nuclei from the primary tumor were analyzed with a Zeiss inverted IM microscope 

(Carl Zeiss. Inc., Thomwood, NY) equipped with a Zeiss Planochromatic lOOX oil 
emersion objective, giving a total magnification of 2440X. The nuclear images were 
digitized and analyzed with the DynaCELL''"'*^ Motility Morphometry Measurement 
workstation (JAW Associates, Inc.. Annapolis, MD). In this invention, the nuclear 

10 roundness variance measurement is the only calculation used from the DynaCELL™ 

Motility Morphometry Measurement software. 



Example VII: Utilization of Increased Magniflcation (63X) to Reduce the Number 
15 of NMD's Required to Predict Progression 

Using a subset of the original patient sample (10 progressors and 10 non-progressors), 
measurements were conducted as in Examples II & III, except that instead of using the 
normal 40X objective, a 63X objective lens was used. The data and statistics obtained using 

20 the 63X objective were analyzed and compared lo the data and statistics obtained using the 
40X objective. Table III summarizes the results of the statistical analysis using the 40X and 
63X data to predict prostate cancer progression in the subset of 20 patients. Please note that 
the total number of NMD's required to predict an outcome is decreased as the magnification 
increases (Table III; also see FIGS. 3 - 6), as well as significant changes in the actual 

25 individual NMD's utilized in the model. 
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Example VIII: 



Immunochemical Staining for Her-l/neu (C'€rbB2) 



Her-2-neu (c-erbB2) monoclonal antibody (Ab-3, OP- 15) was provided by Oncogene 
Sciences Inc. (Uniondale. NY) as a gift. The Supersensitive MultiLink™ kit (BioGenex 

5 Inc., CA), which employs the strep-avidin biotin complex (ABC) alkaline phosphatase (AP) 
labelling method, was used for monoclonal antibody detection. All staining was performed 
with the MicroProbe™ manual staining system (Fisher Scientific, Pittsburgh, PA) that 
utilizes capillary action vertical staining principles. Incubation for the monoclonal antibody 
was 40 centigrade overnight. Briefly, the staining procedure includes first preparing the 

10 immunostaining reagents as follows: 

Immunostaining Reagent Preparation 



PBS pH 7,6 with 0.1% Triton X-100 



15 



20 



Place 450 ml of Type I HjO into a 500 ml graduated cylinder. Then add one 
envelope of Coulter PBS Buffer Reagent (Coulter Source, Marietta, GA) to the type I 
water while stirring. Adjust the pH to 7.6 with approximately 20 drops of 1 N 
NaOH (a plastic transfer pipet is useful in adding the NaOH). Pipette 500 ^1 of 
Triton X-1(X) to the solution. Next, Adjust the volume of the solution to 500 ml with 
Type I HjO. 



PBS pH 7.6 with 0.5% Triton X-IOO 



25 



Place 450 ml of Type I HjO into a 500 ml graduated cylinder. . Then add one 
envelope of Coulter PBS Buffer Reagent to the type I water while stirring. Adjust the 
pH to 7.6 with approximately 20 drops of 1 N NaOH (a plastic transfer pipet is useful 
in adding the NaOH). Pipette 2.5 ml of Triton X-100 to the solution. Adjust the 



volume of the solution to 500 ml with Type I HjO. 
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IM Levamisole Stock Solution 

Measure 241 mg (0.241 g) of levamisole (Sigma) using an analytic balance. 
Place the levamisole into a 1.5 ml microcentrifuge tube containing 1 ml of Type I 
HjO. Mix the contents with the aid of a vortex mixer. Store the solution at 4 'C 
until it is used. 

5% Nonfat dry milk with PBS pH 7.6 0.1% Triton X-100, 0.05% thimerosal 

Place 5 grams of nonfat dry milk in a Erlenmeyer flask containing 100 ml PBS 
pH 7.6 with 0.1% Triton® X-100. Then, add 0.05 g of thimerosal and mix the 
solution by stirring. Store 5 ml aliquots of the solution at -80* C. Upon thawing, 
the solution should be stored at 4*C. Do not use this solution if it has been stored at 
4'C for longer than 5 days. 

0.5% Nonfat dry milk with PBS pH 7.6 0.1% Triton X-100 

Pipette 100 ^1 5.0% nonfat dry milk with PBS pH 7.6 0,1% Triton X-lOO, 
0.05% thimerosal into a 1.5 ml microcentrifuge tube or 10 ml test tube containing 
900 ^1 of PBS pH 7.6 with 0.1% Triton X-100. Mix the solution with the aid of a 
vortex. The solution should be stored at 4'C. Do not use this solution if it has been 
stored at 4'C for longer than 5 days. 

C-Neu, Her.2/Neu (1:40) 

Pipette 875 /xl of PBS pH 7.6 with 0.1% Triton X-lOO into a 1.5 ml 
microcentrifuge tube or 10 ml test tube. Pipette 100 /zl 5.0% nonfat dr)- milk with 
PBS pH 7.6 with 0.1% Triton X-100 to the tube and mix the solution with the aid of 
a vortex. Then, pipette 25 /xl of C-Neu (ab-3) to the tube and mix with the aid of a 
vortex. The antibody should be added last to the solution. 
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Normal Mouse Serum Control (1:1000) 

Pipette 899 /xl of PBS pH 7.6 with 0.1 % Triton X-100 into a 1.5 ml 
microcentrifuge tube or 10 ml test tube. Pipette 100 /xl 5.0% nonfat dry milk with 
PBS ph 7,6 with 0.1 % Triton X-100 to the tube and mix the solution with the aid of a 
vortex. Pipene 1 fil Normal Mouse Serum (Dako) to the solution and mix with the 
vortex. The normal mouse serum should be added last to the solution. 

Mouse IgGl Isotypic Control (1:200) 

Pipette 895 ^1 of PBS pH 7.6 with 0.1% Triton X-100 into a 1.5 ml 
microcentrifuge tube or 10 ml test tube. Pipette 100 ^i] 5.0% nonfat dr)' milk with 
PBS ph 7.6 with 0.1% Triton X-100 to the tube and mix the solution with the aid of a 
vortex. Pipette 5 fi\ Mouse IgGl Isotypic Control (Coulter) to the solution and mix 
with the vortex. The Mouse IgGl Isotypic Control should be added last to the 
solution. 

2*Ab (Biotinylated anti-mouse IgG, Multilink) 

Comes premixed in the BioGenex Large Volume MultiLink Kit 

Label (Streptavidtn/ Alkaline Phosphatase) 

Comes premixed in the BioGenex Large Volume MultiLink Kit 

Fast Red Chromogen Solution 

Pipette 5 /xl of 1.0 M Levamisole to the 5 ml vial of Naphthol Phosphate in 
Tris Buffer. Add one Fast Red Tablet to the solution and vortex until the tablet is 
completely dissolved. This solution must be used immediately after preparation. 
'''Levamisole is added to the Fast Red Solution to block endogenous alkaline 
phosphatase activity. 
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The Her-2/neu antigenicity was then scored. The scoring method assessed the amount 
of staining area within the •'dotted cancer zone" as either negative (0), focal (1), or diffuse 
(2), and the intensity of the staining was scored as 0-4 + , ranging from negative (0) to strong 
red color (4-f ) resulting from the AP red substrate reaction (see FIG. 28a). 

'5 

Example IX: Immunochemical Staining for PD-41 

The PD-41 (Prostate Mucin Antigen) monoclonal antibody was provided by Dr. 
George Wright at Eastern Virginia Medical School under a materials transfer agreement. 

10 The Supersensitive MultiLink™ kit (BioGenex Inc., CA), which employs the strep-avidin 
biotin complex (ABC) alkaline phosphatase (AP) labelling method, was used for monoclonal 
antibody detection. All staining was performed with the MicroProbe™ manual staining 
system (Fisher Scientific, Pittsburgh. PA) that utilizes capillary action vertical staining 
principles. Incubation for the monoclonal antibody was 37 o centigrade for 15 minutes. 

15 Briefly, the staining procedure includes first preparing the immunostaining reagents as in 
Example X, except with the following changes: 

Immunostaining Reagent Preparation 

20 PD-41 (15 Mg/ml) 

Place 800 m1 of PBS pH 7.6 with 0. 1 % Triton X-100 in a 1 .5 ml 
microcentrifuge tube. Add 100 ^1 5% milk to the tube and mix the contents with the 
aid of a vonex mixer. Then, add 100 m1 of PD-41 to the tube and mix the contents 
with the aid of a vortex mixer. 

25 

CAS Red Chromogen Solution 

Add 900 /xl of Type I 11,0 to a 1.5 ml microcentrifuge tube. Add the 1 ^1 of 
1 M levamisole to the tube. Then, add 100 /il of CAS red substrate concentrate and 
mix the contents with the aid of a vortex mixer. Add 45 /xl of CAS red chromogen 



4 
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concentrate (always add this ingredient last) to the solution and mix the contents with 
the aid of a vortex mixer. This solution must be used immediately after preparation. 



The PD-41 antigenicity was then scored. The scoring method employed the number 
of positive staining ducts divided by the total number of ducts in the "dotted cancerous 
zone". The percentages of positively staining ducts was used as a patient result. 

PD-41 Background 

Monoclonal antibody PD-41, a mouse IgG,^, was first described by Beckett et al. 
(Beckett, ML, Lipford, GB, Haley, CI, Schellhammer, PF and Wright, GL. Monoclonal 
Antibody PD41 Recognizes an Antigen Restricted to Prostate Adenocarcinomas. Cancer Res. 
51:1326-1333, 1991) by its reactivity to an prostate adenocarcinoma-restricted mucoprotein 
known as prostate mucin antigen (PMA). The target PMA, an 0-linked oligosaccharide- 
associaced protein with a molecular weight of >'400 kd in prostate cancer patient seminal 
plasma, has not been demonstrated to recognize mucins at other organ sites. Wright et al. 
(Wright, GL. Beckett, ML et al. Mucins as biomarkers of prostate carcinoma. J. Urol. 
149:450A. 1993) demonstrated immunoperoxidase imjnunoreaciivity of PD-41 with in 100% 
of primar}', 71 % of metastatic carcinomas and under 1% of normal and benign prostatic 
tissues, including BPH. 

An independent study of 95 prostate needle core biopsy paraffin-embedded sections 
showed PD-41 reactivity in ductal epithelia and/or prostatic glandular secretions within 56% 
(53/95) of prostate tumor specimens (Marley, GM, Veltri, RW, Patton, KP and Wright, GL. 
Histochemical Expression of a Unique Prostate Mucin Antigen from Core Biopsies. Proc. 
Amen Assoc. Cancer Res. 34:28, 1993). When Gleason score or DNA ploidy were 
employed as stratification parameters, PD-41 proved to be an independent factor of 
prognostic value. Clinical follow-ups of 61% of this cohort confirmed that PD-41 expression 
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acted as an independent marker of tumor aggressiveness (Veltri, RW et al, , recent CDI 
unpublished data). 

Example X: The Patient Sample 

•5 

A group of one hundred and twenty-four (124) patients with localized prostate cancer 
were used in this study. The sample was optimized for the evaluation of mmor progression. 
The patients had clinically localized prostate cancer and were followed for evidence of 
progression based upon one or more of the following events: a detectable post-operative 

10 prostate specific antigen (PSA) level, local recurrence, evidence of metastasis following 
radical prostatectomy, or death. The patient sample had been clearly defined for pre- 
operative Gleason grades, post-operative Gleason grades, clinical and pathological stage, 
organ disease confinement, focal or established capsular penetration, and surgical margin 
status. None of these patients had any seminal vesicle or lymph node invasion. As used 

15 herein, pre-operative Gleason sore is the highest Gleason score from all positive sextant 

biopsies for a single case. The demographics of the patient sample is illustrated in Table IV. 

Example XI: Statistical Analysis - Logistic Regression 

The logistic regression statistical analysis of the data was performed using the 

20 STATA™ v3.1 (Staia Corporation, College Station, TX) statistical analysis software 

program. This invention applied logistic regression to every independent parameter (e.g. 
NMD's, biomarkers, Gleason scores, etc.) first to select the univariately significant variables 
for progression or organ confmed disease status (Table V & VIII) using the STATA™ 
statistical software package {STATA™ conmiand: logistic). Statistical significance consisted 

25 of p values < 0.05. Next, the univariately significant independent variables were 

multivariately assessed using backwards stepwise logistic regression {ST ATA™ command: 
swlogis) to determine which independent variables (e.g. NMD's (CMP or JVB), Gleason 
Score, and biomarkers) were aggregately significant in the prediction of progression or organ 
confined disease status (Tables VI, Via, VII. Vila, IX, IXa, X, & Xa). The software 
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program generated Receiver Operator Characteristic (ROC) curves with investigator selected 
cutoff, resulting in optimized sensitivity, specificity, positive predictive values, and negative 
predictive values (see above listed tables and FIGS. 7-27). Kaplan-Meier actuary plots 
were also generated for the progression analysis. 

STATA™ also provides a command (logit, an estimated maximum-likelihood logit 
model) that provides the weighted coefficients for the statistically significant independent 
variables used in the multivariate model as well as the model constant. The general formulas 
for calculating the predictive index and predictive probability are as follows: 

Predictive Index {xb) = (iSq + ^iVar(l) + /?2var(2) + + 

^.var(/i)) 

Predictive Probability (p) = e^* / (I 4- e'*) 
Where: 

^0 = Formula Constant; 
j3, through ^„ = Weight factors for variables 1 through n; var(l) through 

var(/?) = Independent variables being used in logistic regression 

model. 

The final calculation of the predictive probability provides a patient-specific value, between 0 
and 1, for the probability of a specific outcome (e.g. progression or organ confined disease 
status). The threshold value (cutoff) for the predicted probability is selected based upon the 
results of the ROC curves. Equation 2 gives an example using the weighted formula. 



-70- 



wo 97/12247 



PCTAJS96/I5780 



Formula for Progression Predicted Probability of JVB 
Morphometry Features Alone 

(-61.2378) + (-0.7827881)(stdevl) + (3.209138)(stdev2) + 
(13.90239)(stdev4) + (.1381.354)(stdev7) + (11.52351)(stdev8) + 
(-L18553)(sidevl0) + (-.0.603529)(stdevll) + (0.6900095)(stdevl4) 
+ (1254.563)(stdevl5) -f (2964.96)(stdevl7) + (2.3701 12)(stdev24) + 
(505.4493)(stdev26) + (-7.731 191)(stdev28) + (84.19147)(stdev31) + 
(-28.88644)(stdev34) + (0,01796)(varl) + (-0.0390615)(var2) + 
(7.529075)(var3) + (^5.39564)(var4) + (12174.49)(var7) + (- 
18176.55)(varl5) -f (-26895. 28)(varl7) + (-2.079085)(var22) + 
(1.093606)(var28) + (2I80.479)(var29) + (1.959982)(var34) -h (- 
7.559859)(var35) 

Predictive Probability: 

Morphometry Pj = exp(Xjb) / {l4-exp(Xjb)), 

where x, = The (row) vector of independent variables of the j"^ observation, (i.e., the 
independent variable value). 
b = The corresponding estimated parameter (column) vector, (i.e., 

the weight factor associated with that particular independent 
variable). 

Pj ~ Predicted probability of a positive outcome for the j'^' observation. 



Equation 2: 

Predictive Index: 
Morphometry xp = 
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Example XII: Application of Neural Networks 

The multilayer feed forward perceptron with error back propagation training method 
is chosen for this work. The back propagation method is a gradient based learning 
S procedure, but has the drawback of local optimum. Studies show sigmoid activation 

function, which is often used with neural network, is not necessarily the optimal choice. It 
has been suggested in certain classes of problems that the use of sinusoidal or gaussian 
activation functions reduce the training time substantially. In this work, both sigmoid and 
sinusoidal activation functions are studied. 

10 

In a multilayer neural network, hidden layers are of particular imponance. How well 
the network approximates the discriminate surface to a large degree depends on the number 
of hidden neurons. Allowing too few or too many parameters to be used in the training will 
lead to under or over fitting. Therefore, efforts have been made to identify the optimal 
IS number of hidden neurons. 

The neural network (NN) software program of the present invention has a single 
hidden layer. Morphometry data from the radical prostatectomy samples was analyzed, and 
a total of 28 NMD's were extracted. Backwards stepwise logistic regression analysis of the 

20 data utilizing the STAT A™ software showed that only 14 of the NMD's were multivariaiely 
significant. The 30 feamre network used all 28 NMD's, post operative Gleason score, as 
well as the perimeter and nuclear roundness variance calculated using the CAS-200). Using 
the data sets of 15, 28, and 30 measurements, two different network types were trained, a 
standard multilayer sigmoidal type with a single hidden layer, and a hybrid network 

25 previously described. Further utilization of the data used for training these networks within 
the scope of the invention will result in networks with greater accuracy. 
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Example XIII: Organ Confinement Model 

The following example describes the procedures for multivariate statistical analysis as 
applied lo the prediction of organ-confined disease status that is based on sextant biopsy 
pathology, PSA and quantitative image analysis. 

The parameters utilized for the model include patient PSA, number of positive sextant 
core biopsies, tumor location (base and/or apex involvement). Sum % area of mmor 
involvement. Gleason Score, DNA ploidy, and quantitative nuclear grade (a combination of 
size, shape, and DNA content descriptors, and Markovian texture as determined using a 
10 CAS-200 Image Analysis System). 

A total of two-hundred and twenty-seven (227) sextant biopsy specimens provided, 
(n=150) including from collaborators at Ohio State University {n=77) that were used to 
prepare the trained algorithm (Table XIV). One hundred and eighteen (118 or 52%) patients 

15 were non-organ confined (NOC) with 13 (6%) of these NOC patients having positive lymph 
nodes and 43 (19%) having seminal vesicle involvement. Five micron sections were 
prepared from paraffin-embedded specimens and placed on Probe-On™ slides, the first 
section was stained from paraffin-embedded specimens and placed on Probe-On™ slides, the 
first section was stained with H&E and the next with the Feulgen stain (B-D Cellular 

20 Imaging Systems, San Jose, CA). The sequential H&E and Feulgen stained slides were 
screened and "dotted" by uropathologists. From the "dotted" areas of the Feulgen stained 
slides, 125 intact cancer cell nuclei were analyzed using the CAS-200 Image Analysis 
Systems (B-D Cellular Imaging Systems, San Jose, CA). Cellsheet'** vl.Od, a new lisimode 
file conversion software package (JVB Imaging, Elmhurst, IL) which calculates 38 NMD's, 

25 including shape, size, and DNA content descriptors, and Markovian texmre features 

(Pressman. 1976) (Table XV), was employed to process the nuclear image data. All data 
were analyzed using STATA™ (Stata Corp., College Station. TX), CART (CART vl.02, 
Salford Systems, San Diego, CA), and the p values were calculated using the non-parametric 
logistic regression analysis method. 
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TABLE XV 

CellSheet ' vl.Od Nuclear Morphometric Descriptors 



1 

X • 


Ohiecf ^11 m Ontical Den^itv 


22 


lyid^uiiai iviurncni 


2. 


Ohiect Size 


23. 


Ow.U-UilU LJiA^yjluXl .Yl\JlIICill 






94. 


rrouuci iviomeni 


4 


Picograms of DNA 


25 


1 lidiiguiar ^ynuuCiry 


5 


Angular Second Moment* 


26. 




6. 


Contrast* 


27. 


Cell Cla^ciflratinn 


7 


r^nrrp 1 a 1 1 r»n * 




(\ = HvnAHinlr^iH — • T^ir^loiri 
V A riy puuipioiu, z — L/ipiOlu , 


Q 
O. 


L/iiicrenL.c xviOiiiciii 




J — o-r nase , d — i eirapioia , 




invciSv L/jiicrcncc xvxuuicni 




o — xiypcrpioiaj 


in 


i3ulll /WCldgC 






1 1 

i 1 • 


i3Ulll V al laiiwc 


28 




12 


Entropy* 


29 


DNA Index 


13. 


30. 


Density 


14. 


Difference Variance* 


31. 


Average Optical Density 


15. 


Difference Entropy* 


32. 


Feret X 


16. 


Information Measure A* 


33. 


Feret Y 


17. 


Information Measure B* 


34. 


Maximum Diameter 


18. 


Maximal Correlation Coefficient* 


35. 


Minimum Diameter 


19. 


Coefficient of Variation* 


36. 


Elongation 


20. 


Peak Transition Probability* 


37. 


Run Length 


21. 


Diagonal Variance* 


38. 


Configurable Run Length 



*NOTE: Values 5-25 are grain dependent Markovian texture features. Grain may be 
looked at as a measurement in pixels of the width of an averaged sized object. 
The grain values for all of these measurements were set to 1 . 



A. Specimen Collection and Processing 

Each patient had their prostate biopsied in six different areas (left apex, left mid, left 
base, right apex, right mid, and right apex) using an 18-gauge "tni-cut" type needle and a 
spring driven biopsy gun. Each biopsy was placed in a properly labeled vial containing 10% 
neutral buffered formalin. The specimens were then processed and embedded in separate 
paraffin blocks using Tissue-Tec processing and embedding stations. Each block is labelled, 
and then multiple 5/xm tissue sections are cut, placed on glass slides, air dried, and heat 
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fixed. The first and third slides (Levels I & III) are H&E stained. The second slide (Level 
II) is Feulgen stained, and the fourth slide (Level IV) is stored. 

Each of the H&E slides was reviewed by a pathologist to determine which of the 
• 5 slides contains tumor cells. The number of positive cores is determined by adding up the 
number of areas which contain tumor (/.e. left base, left mid, left apex, right base, right 
mid, or right apex). 

The pathologist determines from each positive slide a Gleason score, and then 
10 calculates the percent of tumor involvement for that biopsy. This is done by measuring first 
the total length of the. tissue in the 5 ftm section, and then the total length of the tissue which 
is involved in the mmor. The percentage of tumor involvement is then calculated by dividing 
the tumor length by the overall biopsy length. Once these determinations have been done for 
each slide, the total % involvement is calculated by adding up the percentages of tumor 
15 involvement from each positive biopsy. 

The tumor location variable is then determined. If there is tumor involvement in the 
base and/or apex biopsies > 5% total, a value of 1 is entered for the variable. If the 
involvement of tumor in the base and/or apex biopsies is < 5% total, a value of 0 is entered 
20 for the variable. 

The highest Gleason score is determined by simply using the highest Gleason score 
from the positive biopsies for that particular case. The pathologist then determines the 
positive core biopsy that best represents the "worst" area of tumor for the case, and that 
25 biopsy is used for the DNA ploidy and quantitative nuclear grade calculations. 
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B. Collection and Processing of Feulgen Stained Cancer Nuclei and Calculation of 
Quantitative Nuclear Grade 

The Feulgen stained cancer cell nuclei from the S/xm sextant biopsy specimens are 
analyzed and the images capoired using the QDA v3.0 software program on a CAS-200 
Image Analysis System. The optical system is calibrated using the CAS calibration slides 
that were stained with the specimen slides by measuring at least 20 calibration cells, with a 
resulting calibration peak percent coefficient of variation (%CV) of less than 2.0%. Next, 
using the biopsy which is the most representative of the cancer (selected by the pathologist), 
a minimum of 125 cancer cell nuclei are analyzed using the method described below at the 
specified magnification. The QDA v3.0 software program generates a DNA histogram and 
saves the nuclear images as well as the DNA content information to a listmode file (*.ILM). 
The listmode file is then uploaded into a network system using a Cortex program, and the 
nuclear images are converted into 38 different nuclear morphometric descriptors (NMD's) (8 
nuclear shape descriptors, 8 DNA content descriptors, and 22 Markovian texmre descriptors) 
using a customized version of the Cell Sheet vLOd program. A C-language program is then 
used to calculate the variance of each of the 38 different NMD's for the population of cells 
measured from a particular case (i.e. 125 cells). The variances of 22 of the 38 NMD's are 
then used to calculate a Quantitative Nuclear Grade (QNG) for the case, using a formula 
derived from a logistic regression model (Table XVI). 
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10 



N227QNG PI = 



TABLE XVI 
Quantitative Nuclear Grade Formula 

(1.519492) + (-1.167831)(Var3) + (-24.77048)(Var4) + 
(102580)(Var5) + (-0.0010737)(Var6) + (1667.514)(Var7) + 
(1.33661)(Var8) + (-662.9786)(Var9) + (0.00000248)(Varll) + 
(-45668.71(Yarl2) + (0.0019424)(Varl4) + (-2541.086)(Varl7) + 
(I154.989)(Varl9) + (.6269.475)(Var20) + (14700000)CVar21) + 
(-1751.078)(Var26) + (1.042041) (Var 27) + (0.2199372)(Var28) + 
(1276.338)(Var29) + (.1.377949)(Var34) + (-1.394814)(Var35) + 
(22.47551)(Var36) + (-16.41575)(Var38) 



N227QNG = exp(N227Txt PI) / (1 + exp(N227Txt PI)) 



15 



20 



25 



30 



Var3 
Var4 
Var5 
Var6 
Var7 
Var8 
Var9 
Varll 
Varl2 
Varl4 
Var 17 
Var 19 
Var 20 
Var 21 
Var26 
Var27 
Var28 
Var29 



Population Variance (n-1) of Object Shape 
Population Variance (n-1) of Pg. DNA 

Population Variance (n-1) of Angular Second Moment (Pixel Step Size = 1) 
Population Variance (n-1) of Contrast (Pixel Step Size = 1) 
Population Variance (n-1) of Correlation (Pixel Step Size = 1) 
Population Variance (n-1) of Difference Moment (Pixel Step Size = 1) 
Population Variance (n-1) of Inverse Difference Moment (Pixel Step Size = 1) 
Population Variance (n-1) of Sum Variance (Pixel Step Size =1) 
Population Variance (n-1) of Sum Entropy (Pixel Step Size =1) 
Population Variance (n-1) of Difference Variance (Pixel Step Size = 1) 
Population Variance (n-1) of Information Measure B (Pixel Step Size « i) 
Population Variance (n-1) of Coefficieiit of Variation (Pixel Step Size = 1) 
Population Variance (n-I) of Peak Transition Probability (Pixel Step Size = 1) 
Population Variance (n-1) of Diagonal Variance (Pixel Step Size = 1) 
Population Variance (n-1) of Standard Deviation 
Population Variance (n-1) of Cell Classification 
Population Variance (n-1) of Perimeter 
Population Variance (n-1) of DNA Index 
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Var34: Population Variance (n-1) of Maximum Diameter 
Var35: Population Variance (n-1) of Minimum Diameter 
Var36: Population Variance (n-1) of Elongation 

Var38 Population Variance (n-1) of Configurable Run Length (Pixel Sample Size = 1, 
Pixel Difference Threshold = 0.1, Projection Configuration: All four 
possibilities, Normalized to Cell Area) 

5 

C. Cancer Cell Selection Method 

The biopsy to be analyzed for DNA and QNG is selected by an expert pathologist. 
The biopsy chosen is the one which the pathologist thinks is the most representative of the 
cancer present in the sextant biopsy specimen. The cancer area on the biopsy is identified 

10 and marked by the pathologist. Once the nimor area(s) have been identified, a minimum of 
125 Feulgen stained nuclei representing the "worst" cells present are analyzed and the images 
captured. The cells selected may not be overlapping, and they may not contain any "holes", 
which is the result of an improper background threshold setting in the QDA software. The 
"worst" cells are the highest grade cancer cells present in the tumor area(s), which may vary 

15 from low grade, well differentiated cancer to high grade, poorly differentiated cancer. Once 
the required number of cells are collected, the DNA content information and nuclear images 
are saved to a listmode file. 

D. Analysis of CAS-200 DNA Histograms 

20 Three different methods employing cut-offs based upon the results of a DNA 

consensus meeting held a Prautz Neck. ME in 1992 (Shankey TV, aL Cytometry 14:397- 
500, 1992) were used to interpret and classify the DNA histograms. The histograms were 
interpreted by the pathologists, and a DNA ploidy classification assigned to the case. The 
classification methods are as follows: 

25 

DNAl: (0) Diploid; (I) Out of Normal Range (ONR: HypodipIoid,m (2) ONR: 
>S+G2M 111-21%]; (3) Abnormal: >S-l-G2M L>21%]; (4) 
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Abnonnal: Aneuploid [>.10% cells in aneuploid peak]; and (5) 
Abnonnal: Tetraploid L> 16% cells in tetraploid peak] 

DNAIO: (0) Diploid; (1) ONR: Hypodiploid and ONR: >S+G2M [11-21%]; 
(2) Abnonnal: >S+G2M L>21%], Aneuploid L>10% cells in 
aneuploid peak], and Tetraploid L> 16% cells in tetraploid peak] 

DNAIO: (0) Diploid and ONR, (2) Abnonnal 

E. Pathological Assessment of Sextant Core Biopsies 

An expert pathologist reviews the H&E and Feulgen stained 5/iin sections and 
identifies the biopsies which contain cancer for each case, as well as determining the Gleason 
score and the % linear involvement of the tumor for every positive sextant biopsy involved in 
a single case. The pre-biopsy PSA value for each case is provided by the physician, when 
available. 

F, Quantitative Nuclear Grade 

The Quantitative Nuclear Grade (QNG) utilizes quantitative micro-spectrophotometric 
image analysis with a CAS-200 image analysis system (Becton-Dickinson-Cell Analysis 
Systems, San Jose. CA) to capture uropathologist selected cancer cell nuclear images stated 
by the Feulgen method. A unique software package (CellSheet^, Elmhurst, IL) is used to 
transform listmode data files into nuclear features which mathematically characterize shape, 
size» DNA content, and chromatin complexity. There are 38 nuclear descriptors, and they 
include eight DNA content descriptors, eight size and shape descriptors, and twenty-two 
Markovian texture (chromatin complexity) descriptors. The inventors have studied the 
application of this software closely with the developer and determined important features that 
are useful for urologic malignancies, and in particular prostate cancer. The present studies 
required testing several hundred pathologically defined specimens with well characterized 
clinical follow-up in order to assess the conelation of the QNG variables to specific 
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outcomes such as progression and pathological stage (non-organ-confmed disease status). 
The Quantitative Nuclear Grade (QNG) is calculated using non-parametric logistic regression 
statistical methods that determine the univariately and multivariately significant subsei of the 
38 available Cell Sheet"* descriptors that are useful for the prediction of the outcome of 
interest. The QNG technology has been applied to both sextant core biopsies and 
pathologically diagnostic radical prostatectomy blocks successfully. 

G. Statistical Analysis of N227SXNT Database 

CART analysis (CART vl.02, Salford Systems, San Diego, CA) was used to 
determine optimal cutoff (i,e. maximum sensitivity) for Pre-Op PSA by limiting the tree 
growth to two result nodes (FIG. 30). Pre-Op PSA is the pre-sextant biopsy PSA value, and 
the optimal cutoff value was found to be 16. 3 ng/ml. A binary variable was created for 
the PSA values; 0 = PSA <_16.3 ng/mU 1 = PSA > 16.3 ng/ml. Next, the variances of all 
38 NMD's obtained from the CellSheet™ vl.Od Beta4 program {JWB Imaging, Elmhurst. IL) 
were multivariately assessed for their significance in the prediction of organ confinement 
using backwards stepwise logistic regression (STATA^, College Station, Texas). The 
variances of twenty-two NMD's were retained in a multivariate model. The model constant 
and weighting factors for the twenty-two multivariately significant NMD's were obtained by 
estimating the maximum-likelihood logit model, and the predictive probabilities were then 
calculated and used as the QNG variable (FIG. 31). CART was tried to determine an 
optimal cutoff, but the use of the continuous QNG variable proved to be the most 
univariately significant in the prediction of organ confinement (FIG. 32). The formulas 
utilized in determining organ confinement are set forth in Table XVII. 



TABLE XVII 



Formulas for Determining Organ Confinement 



PosCores: 

HiPreGI: 

Totlnv: 



Number of cores positive for cancer. 

Highest Gleason score from all positive cores for a single case. 
Sum of the % Involvement from all positive cores for a single case. 
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DNAIO: DNA Ploidy from single diagnostic core (Diploid = 0, ONR = 1 , or 
Abnormal = 2) 

BsApxlO-S: Involvement in the Base or Apex cores ^5%. S% Involvement - 1, 
< 5% Involvement = 0) 
5 PrePSAl6: Pre-Biopsy PSA Score 16.3 ng/ml = 1. < 16.3 ng/ml = 0) 

N227QNG: Quantitative Nuclear Grade - Predictive Probability calculated using Variance 
of 22 Cell Sheet vl.Od Beta2 feanires 

Formula for cases mth PSA Values: 

I'O 

(SxtPSA-PI) 

Predictive Index = (1.978019) + (0.1267477)(PosCores) + (-0.5226136)(HiPreGI) + 
(-0.0064845)(Todnv) + (0.2130566)(DNAIO) + 
(-0.7773022)(BsAPX10-5) + (-2.203646)(PrePSA16) + 
15 (5.08648)(N227QNG) 

Predictive Probability = exp(SxtPSA-PI) / (1 +exp(SxtPSA-PI)) 

Organ Confinement Prediction: 0 = Predicted to be Non-Organ Confined (any Predictive 
20 Probability S 0.500) 

1 = Predicted to be Organ Confined (any Predictive 
Probability > 0.500) 

25 Formula for cases without PSA Values: 

(SxtNoPSA-PI) 

Predictive Index = (1.026344) + (0.1539648)(PosCores) + (-0.4467591)(HiPreGI) + 
(-0.007922)(Toanv) + (0.31 3745 1)(DNAI0) -t- 
30 (.06676012)(BsApxl0.5) -»- (5.075 14)(N227QNG) 

Predictive Probability = exp(SxtNoPSA-PI) / (1 +exp(SxtNoPSA-PI)) 

Organ Confinement Prediction: 0 = Predicted to be Non-Organ Confined (any Predictive 
35 Probability < 0.500) 

1 = Predicted to be Organ Confined (any Predictive 
Probability > 0.500) 



40 A number of tumor location (i.e.. Base, Apex, Mid) variables were analyzed using 

logistic regression to determine their univariate significance in the prediction of organ 
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confinemeni. The most significant univariate variable was determined to be involvement of 
^5% in the base and/or apex biopsies. A number of Gleason scoring variables (i.e. Age, 
DNA ploidy. Quantitative Nuclear Grade, Number of Positive Cores, Sum Total % 
Involvement, ect.) were analyzed using logistic regression to determine their univariate 
5 significance in the prediction of organ confinement. Note: DNA ploidy (DNA 10) was 
retained in the multivariate model even though it was not univariately significant. The 
univariately significant variables (i.e. Number of Positive Cores, Total % Involvement, 
Highest Gleason Score, PSA at a cutoff of <,16.3 ng/ml. Base and/or" Apex Involvement 
>.5%, and the Quantitative Nuclear Grade) and DNA ploidy (DNAIO) were muliivariately 
10 assessed for their significance in predicting organ confinement using logistic regression. 

Results . The univariately significant independent variables included # of positive 
cores (p < 0.00001), biopsy Gleason score (p < 0.00001), quantitative nuclear grade (p < 
0.00001), sum % tumor involvement (p < 0.00001), pre-operative PSA at 16.3 ng/mL 

15 cutoff (p < 0.00001), and mmor location (base and/or apex involvement) (p = 0.00008) 
(Table XVIII). Using a multivariate backward stepwise variant of logistic regression, the 
above independent variables, including DNA ploidy, were retained in a model to identify the 
non-organ confined prostate cancer cases. The complete model (n = 217) had a sensitivity 
- 81.1%, specificity = 77.5%, positive predictive value (PPV) = 77.5%, negative 

20 predictive value (NPV) = 81.1%, and the area under the Receiver Operator Curve (AUC) 
was 0.8555 (FIG. 34). When PSA values were dropped from the algorithm, the model (n = 
227) had a sensitivity of 75.2%, specificity of 73.7%, PPV of 72.6%, NPC of 76.3%, and 
an AUC of 0.8118 (FIG. 33). In FIG. 35, illusurated is quantitative nuclear grade alone as a 
much greater predictor of organ confined disease status that the highest pre-op Gleason score. 

25 While QNG has an AUC of 0.7292, the highest pre-op Gleason score only reached an AUC 
of 0.6460 (Table XVIII). 

To identify a patient as "OC" or •'NOC, the threshold value of 0.5 was used. If the 
output was greater than 0.5, it was considered "1", otherwise it was classified as "O**. Thus. 
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a "r is considered "0C'\ and a "0" is considered "NOC\ As seen in FIG. 36, the addition 
of PSA to the system yielded true positives for organ confinement of 86, versus false 
positives for non organ confinement of 25. True non organ confined negatives were 86, 
while false negatives for organ confinement were 20 (n=217). 



-85- 



wo 97/12247 



PCT/US96/15780 



CO 
CO 

n 



•a 

C - 
d 

c 
o 
U 

c 

CQ 
Ql) 
U 

o 

I 

C 

o 
Z 

E 



o 



-a 



O 

U 
c 

CO 



D. 

CO 



< 

< 
> 

tu 
Q 
2: 

u 
Q 
2 



5 



♦ 

o 

00 

c 
o 

c/> 

C9 
O 

o 



V) 

■ JC 
DO 



O 

c 

< 

Z 
O 

"2 

'B 

5 

>% 
•o 

*o 

< 
Q 



00 



O 



o 



C7N 
00 



00 



VO 

li 

V 

< 

CO 

o. 

o 



II 

A 

X 

a> 
< 

c 

O 

o 
E 
5 



c 

cc 
3 

a 



T3 
O 



c 

o 



DC 



*o 

rsi 
o 
o 



-86. 



wo 97/12247 



PCT/US96/15780 



In summary, the instant methodology has the ability to predict non-organ confmed 
disease status using a combination of laboratory and pathology data obtained from serum and 
sextant biopsy specimens collected at diagnosis. The method significantly enhances the 
ability of a practitioner to maximize the cimiulative contribution of the data collected to the 
benefit of patient. 

EXAMPLE XIV, Use of RT-PCR in Prediction of Organ Confinement 

A additional predictive parameter that may be used in the organ confmement 
procedure is RT-PCR. Reverse transcription (RT) of RNA to cDNA followed by relative 
quantitative PGR may be used to determine the relative concentrations of specific mRNA 
species isolated from normal, benign and cancerous prostate tissues. By determining that the 
concentration of a specific RNA species varies, it is shown that the gene encoding the 
specific mRNA species is differentially expressed. This technique may be used to confirm 
that mRNA transcripts shown to be differentially regulated are differentially expressed in 
organs that have lost confmement versus those in which the disease remains confined. As 
such, the RT-PCR test will be incorporated as an additional parameter in the instant 
invention, applied to both pre- and post-treatment monitoring. 

In PCR, the number of molecules of the amplified target DNA increase by a factor 
approaching two with every cycle of the reaction until some reagent becomes limiting. 
Thereafter, the rate of amplification becomes increasingly diminished until there is not an 
increase in the amplified target between cycles. If one plots a graph on which the cycle 
number is on the X axis and the log of the concentration of the amplified target DNA is on 
the Y axis, one observes that a curved line of characteristic shape is formed by connecting 
the plotted points. Beginning with die first cycle, the slope of the line is positive and 
constant. This is said to be the linear portion of the curve. After some reagent becomes 
limiting, the slope of the line begins to decrease and eventually becomes zero. At this point 
the concentration of the amplified target DNA becomes asymptotic to some fixed value. This 
is said to be the plateau portion of the curve. 
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The concentration of the target DNA in the linear ponion of the PCR is directly 
proportional to the starting concentration of the target before the PCR was begun. By 
determining the concentration of the PCR products of the target DNA in PCR reactions that 
have completed the same nimiber of cycles and are in their linear ranges, it is possible to 
5 determine the relative concentrations of the specific target sequence in the original DNA 
mixmre. If the DNA mixtures are cDNAs synthesized from RNAs isolated from different 
tissues or cells, the relative abundances of the specific mRNA from which the target 
sequence was derived can be determined for the respective tissues or cells. This direct 
proportionality between the concentration of the PCR products and the relative mRNA 
10 abundances is only true in the linear range portion of the PCR reaction. 

The final concentration of the target DNA in the plateau portion of the curve is 
determined by the availability of reagents in the reaction mix and is independent the original 
concentration of target DNA. Therefore, the first condition that must be met before the 
15 relative abundances of a mRNA species can be determined by RT-PCR for a collection of 
RNA populations is that the concentrationp of the amplified PCR products must be sampled 
when the PCR reactions are in the linear portion of their curves. 

The second condition that must be met for an RT-PCR study to successfully determine 
20 the relative abundances of a particular mRNA species is that relative concentrations of the 
amplifiable cDNAs must be nonnalized to some independent standard. The goal of an RT- 
PCR study is to determine the abundance of a particular mRNA species relative to the 
average abundance of all mRNA species in the sample. 

2S Most protocols for competitive PCR utilize internal PCR internal standards that are 

approximately as abundant as the target. These strategies are effective if the products of the 
PCR amplifications are sampled during their linear phases. If the products are sampled when 
die reactions are approaching the plateau phase, then the less abundant product becomes 
relatively over represented. Comparisons of relative abundances made for many different 
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RNA samples, such as is the case when examining RNA samples for differential expression, 
become distorted in such a way as lo make differences in relative abundances of RNAs 
appear less than they actually are. This is not a significant problem if the internal standard is 
much more abundant than the target. If the internal standard is more abundant than the 
5 target, then direct linear comparisons can be made between RNA samples. 

The discussion above describes the theoretical considerations for an RT-PCR assay 
for clinically derived materials. The problems inherent in clinical samples are that they are of 
variable quantity (making normalization problematic), and that they are of variable quality 

10 (necessitating the co-amplification of a reliable internal control, preferably of larger size 
than the target). Both of these problems are overcome if the RT-PCR is performed as a 
relative quantitative RT-PCR with an internal standard in which the internal standard is an 
amplifiable cDNA fragment that is larger than the target cDNA fragment and in which the 
abundance of the mRNA encoding the internal standard is roughly 5-100 fold higher than the 

15 mRNA encoding the target. This assay measures relative abundance, not absolute abundance 
of the respective mRNA species. 

The number of PCR cycles that are optimal for sampling must be empirically 
determined for each target cDNA fragment. In addition, the reverse transcriptase products of 

20 each RNA population isolated from the various tissue samples must be carefully normalized 
for equal concentrations of amplifiable cDNAs. This is very important since this assay 
measures absolute mRNA abundance. Absolute mRNA abundance can be used as a measure 
of differential gene expression only in normalized samples. While empirical determination of 
the linear range of the amplification curve and normalization of cDNA preparations are 

25 tedious and time consuming processes, the resulting RT-PCR assays can be superior to those 
derived from the relative quantitative RT-PCR with an internal standard. 

The presence of amplification products corresponding to prostate cancer-marker 
nucleic acids can be detected by several alternative means. In one embodiment, the 
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amplification product can be detected by gel electrophoresis and ethidium bromide staining. 
Alternatively, following the gel electrophoresis step the amplification product can be detected 
by standard Southern blotting techniques, using an hybridization probe selected to bind 
specifically to a prostate cancer-marker nucleic acid sequence. Probe hybridization may in 
turn be detected by a standard labelling means, for example, by incorporation of [^^P] 
nucleotides followed by autoradiography. The amplification products may alternatively be 
detected using a solid phase detection system as described above, utilizing a prostate cancer- 
marker specific hybridization probe and an appropriate labelling means. The presence of 
prostate cancer-marker nucleic acids in blood or lymph node samples can be taken as 
indicative of a patient with metastatic prostate cancer. 

One embodiment of the instant invention comprises a method for identification of 
prostate cancer cells in a biological sample by amplifying and detecting nucleic acids 
corresponding to prostate cancer cell markers. The biological sample can be any tissue or 
fluid in which prostate cancer cells might be present. Various embodiments include bone 
marrow aspirate, bone marrow biopsy, lymph node aspirate, lymph node biopsy, spleen 
tissue, fine needle aspirate, skin biopsy or organ tissue biopsy. Other embodiments include 
samples where the body fluid is peripheral blood, lymph fluid, ascites, serous fluid, pleural 
effusion, sputum, cerebrospinal fluid, lacrimal fluid, stool or urine. 

Nucleic acid used as a template for amplification is isolated from cells contained in 
the biological sample, according to standard methodologies (Sambrook et al., 1989). The 
nucleic acid may be genomic DNA or fractionated or whole cell RNA. Where RNA is used, 
it may be desired to convert the RNA to a complementary cDNA. In one embodiment, the 
RNA is whole cell RNA and is used directly as the template for amplification. 

Pairs of primers that selectively hybridize to nucleic acids corresponding to prostate 
cancer-specific markers are contacted with the isolated nucleic acid under conditions that 
permit selective hybridization. Once hybridized, the nucleic acid; primer complex is 
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contacted with one or more enzymes that facilitate template-dependent nucleic acid synthesis. 
Multiple rounds of amplification, also referred to an "cycles/ are conducted until a sufficient 
amount of amplification product is produced. 

Next, the amplification product is delected. In certain applications, the detection may 
be performed by visual means. Alternatively, the detection may involve indirect 
identification of the product via chemiluminescence, radioactive scintigraphy of incorporated 
radiolabel or fluorescent label or even via a system using electrical or thermal impulse 
signals (Affymax technology; Bellus, 1994). 

Following detection, one may compare the results seen in a given patient with a 
statistically significant reference group of normal patients and prostate cancer patients. In 
this way, it is possible to correlate the amount of marker detected with various clinical states. 



15 7. Primers 

The term primer, as defmed herein, is meant to encompass any nucleic acid that is 
capable of priming the synthesis of a nascent nucleic acid in a template-dependent process. 
Typically* primers are oligonucleotides from ten to twenty base pairs in length, but longer 
sequences can be employed. Primers may be provided in double-stranded or single-stranded 

20 form although the single-stranded form is preferred. 

2. Template Dependent Amplification Methods 

A number of template dependent processes are available to amplify the marker 
sequences present in a given template sample. One of the best known amplification methods 
25 is the polymerase chain reaction (referred to as PCR) which is described in detail in U.S. 

Patent Nos. 4,683,195, 4,683,202 and 4,800,159, and in Innis et ai, 1990, each of which is 
incorporated herein by reference in its entirety. 
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Briefly, in PCR, two primer sequences are prepared which are complementary to 
regions on opposite complementary strands of the marker sequence. An excess of 
deoxynucleoside triphosphates are added to a reaction mixture along with a DNA 
polymerase, e.g.. Tag polymerase. If the marker sequence is present in a sample, the 
primers will bind to the marker and the polymerase will cause the primers to be extended 
along the marker sequence by adding on nucleotides. By raising and lowering the 
temperature of the reaction mixture, the extended primers will dissociate from the marker to 
form reaction products, excess primers will bind to the marker and to the reaction products 
and the process is repeated. 

A reverse transcriptase PCR amplification procedure may be performed in order to 
quantify the amount of mRNA amplified. Methods of reverse transcribing RNA into cDNA 
are well known and described ib Sambrook er aL, 1989. Alternative methods for reverse 
transcription utilize thermostable DNA polymerases. These methods are described in WO 
90/07641 filed December 21, 1990; Polymerase chain reaction methodologies are well 
known in the art. 

Another method for amplification is the ligase chain reaction ("LCR"), disclosed in 
EPA No. 320 308, incorporated herein by reference in its entirely, in LCR, two 
complementary probe pairs are prepared, and in the presence of the target sequence, each 
pair will bind lo opposite complementary strands of the target such thai they abut. In the 
presence of a ligase, the two probe pairs will link to form a single unit. By temperature 
cycling, as in PCR, bound ligated units dissociate from the target and then serve as "target 
sequences" for ligation of excess probe pairs. u.S. Patent 4,883,750 describes a method 
similar lo LCR for binding probe pairs to a target sequence. 

Qbeta Replicase, described in PCX Application No. PCT/US87/00880, may also be 
used as still another amplification method in the present invention. In this method, a 
replicative sequence of RNA which has a region complementary to that of a target is added 
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to a sample in the presence of an RNA polymerase. The polymerase will copy tlie 
replicative sequence which can then be detected. 

An isothermal amplification method, in which restriction endonucleases and ligases 
5 are used to achieve the amplification of target molecules that contain nucleotide 5'-[alpha- 
thio] -triphosphates in one strand of restriction site may also be useful in the amplification of 
nucleic acids in the present invention. Walker et al,, Proc, NaiL Acad, Sci. USA 89:392-396 
(1992), incorporated herein by reference in its entirety » 

10 Strand Displacement Amplification (SDA) is anodier method of carrying out 

isothermal amplification of nucleic acids which involves multiple rounds of strand 
displacement and synthesis, i.e., nick translation. A similar method, called Repair Chain 
Reaction (RCR), involves annealing several probes throughout a region targeted for 
amplification, followed by a repair reaction in which only two of the four bases are present. 

15 The other two bases can be added a biotinylated derivatives for easy detection. A similar 
approach is used in SDA. Target specific sequences can also be detected using a cyclic 
probe reaction (CPR). In CPR, a probe having 3 ' and 5' sequences of non-specific DNA 
and a middle sequence of specific RNA is hybridized to DNA which is present in a sample. 
Upon hybridization, the reaction is treated with RNase H, and the products of the probe 

20 identified as distinctive products which are released after digestion. The original template is 
annealed to another cycling prove and the reaction is repeated. 

Still another amplification methods described in GB Application No. 2 202 328, and 
in PCT Application No. PCT/US89/01025, each of which is incorporated herein by reference 
25 in its entirety, may be used in accordance with the present invention. In the former 

application, "modified" primers are used in a PCR like, template and enzyme dependent 
synthesis. The primers may be modified by labelling with a capture moiety (e.g., biotin) 
and/or a detector moiety (e.g., enzyme). In the latter application, an excess of labeled 
probes are added to a sample. In the presence of the target sequence, the probe binds and is 
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cleaved caulytically. After cleavage, the target sequence is released intact to be bound by 
excess probe. Cleavage of the labelled probe signals the presence of the target sequence. 

Other nucleic acid amplification procedures include transcription-based amplification 
systems (TAS). including nucleic acid sequence based amplification (NASBA) and 3SR. 
Kwoh et qL, Proc. Natl Acad. ScL USA 86:1173 (1989); Gingeras et aL, PCT Application 
WO 88/10315, incorporated herein by reference in their entirety. In NASBA, the nucleic 
acids can be prepared for amplification by standard phenol/chloroform extraction, heat 
denaturation of a clinical sample » treatment with lysis buffer and minispin columns for 
isolation of DNA and RNA or guanidinium chloride extraction of RNA. These amplification 
techniques involve annealing a primer which has target specific sequences. Following 
polymerization, DNA/RNA hybrids arc digested with RNase H while double stranded DNA 
molecules are heat denamred again. In either case the single stranded DNA is made fully 
double stranded by addition of second target specific primer, followed by polymerization. 
The double-stranded DNA is made fully double stranded by addition of second target specific 
primer, followed by polymerization. The double-stranded DNA molecules are then multiply 
transcribed by a polymerase such as T7 or SP6. In an isothermal cyclic reaction, the RNA's 
are reverse transcribed into double stranded DNA, and transcribed once against with a 
polymerase such as T7 or SP6. The resulting products, whether truncated or complete, 
indicate target specific sequences. 

Davey et ai, EPA No. 329 822 (incorporated herein by reference in its entirety) 
disclose a nucleic acid amplification process involving cyclically synthesizing single-stranded 
RNA ("ssRNA"), ssDNA, and double-stranded DNA (dsDNA), which may be used in 
accordance with the present invention. The ssRNA is a first template for a first primer 
oligonucleotide, which is elongated by reverse transcriptase (RNA-dependem DNA 
polymerase). The RNA is then removed from the resulting DNArRNA duplex by the action 
of ribonuclease H (RNase H, an RNase specific for RNA in duplex with either DNA or 
RNA). The resultant ssDNA is a second template for a second primer, which also includes 
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the sequences of an RNA polymerase promoter (exemplified by T7 RNA polymerase) 5' to 
its homology to the template. This primer is then extended by DNA polymerase 
(exemplified by the large "Klenow" fragment of E, coli DNA polymerase I), resulting in a 
double-stranded DNA ("dsDNA") molecule, having a sequence identical to that of the 
original RNA between the primers and having additionally, at one end, a promoter sequence. 
This promoter sequence can be used by the appropriate RNA polymerase to make many 
RNA copies of the DNA, These copies can then re-enter the cycle leading to very swift 
amplification. With proper choice of enzymes, this amplification can be done isothermally 
without addition of enzymes at each starting sequence can be chosen to be in the form of 
either DNA or RNA. 

MijJer et aL , PCT Application WO 89/06700 (incorporated herein by reference in its 
entiret)') disclose a nucleic acid sequence amplification scheme based on the hybridization of 
a promoter/primer sequence to a target single-stranded DNA ("ssDNA") followed by 
transcription of many RNA copies of the sequence. This scheme is not cyclic, i.e., new 
templates are not produced from the resultant RNA transcripts. Other amplification methods 
include "race" and "one-sided PCR." Frohman, M.A„ In: PCR PROTOCOLS: A GUIDE 
TO METHODS AND APPUCATJONS, Academic Press, N. Y. (1990) and Ohara et aL , 
Proc. natL Acad. ScL USA, 86:5673-5677 (1989), each herein incorporated by reference in 
their entirety. 

Methods based on ligation of two (or more) oligonucleotides in the presence of 
nucleic acid having the sequence of the resulting "dinoligonucleotide," thereby amplifying the 
di-oligonucleotide, may also be used in the amplification step of the present invention. Wu 
et aL , 

Genomics 4:560 (1989), incorporated herein by reference in its entirety. 
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J. Separation Methods 

Following amplification, it may be desirable to separate the amplification product 
from the template and the excess primer for the purpose of determining whether specific 
amplification has occurred. In one embodiment, amplification products are separated by 
agarose, agarose-acrylamide or polyacrylamide gel electrophoresis using standard methods. 
See Sambrook et a/., 1989. 

Alternatively, chromatographic techniques may be employed to effect separation. 
There are many kinds of chromatography which may be used in the present invention: 
adsorption, partition, ion-exchange and molecular sieve, and many specialized techniques for 
using them including column, paper, thin-layer and gas chromatography (Freifelder. 1982). 

4, Identification Methods 

Amplification products must be visualized in order to confirm amplification of the 
marker sequences. One typical visualization method involves staining of a gel with ethidium 
bromide and visualization under UV light. Alternatively, if the amplification products can 
then be exposed to x-ray film or visualized under the appropriate stimulating spectra, 
following separation. 

In one embodiment, visualization is achieved indirectly. Following separation of 
amplification products, a labeled, nucleic acid probe is brought into contact with the 
amplified marker sequence. The probe preferably is conjugated to a chiomophore but may 
be radiolabeled. In another embodiment, the probe is conjugated to a binding partner, such 
as an antibody or biotin, where the other member of the binding pair carries a detectable 
moiety. 

In one embodiment, detection is by Southern blotting and hybridization with a labeled 
probe. The techniques involved in Southern blotting are well known to those of skill in the 
art and can be found in many standard books on molecular protocols. See Sambrook et al. , 
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1989. Briefly, amplification products are separated by gel electrophoresis. The gel is then 
contacted with a membrane, such as nitrocellulose, permitting transfer of the nucleic and 
non-covalent binding. Subsequently, the membrane is incubated with a chromophore- 
conjugated probe that is capable of hybridizing with a target amplification product. 
5 Detection is by exposure of the membrane to x-ray film or ion-emitting detection devices. 

One example of the foregoing is described in U.S. Patent No. 5,279,721, 
incorporated by reference herein, which discloses an apparanis and method for the automated 
electrophoresis and transfer of nucleic acids. The apparatus permits electrophoresis and 
10 blotting without external manipulation of the gel and is ideally suited to carrying out methods 
according to the present invention. 

The basic technique of differential display has been described in detail (Liang and 
Pardee, 1992). Total cell RNA is primed for fu^t strand reverse transcription with an 

15 anchoring primer composed of oligo dT. The oligo dT primer is extended using a reverse 
transcriptase, for example, Moloney Murine Leukemia Vims (MMLV) reverse transcriptase. 
The synthesis of the second strand is primed with an arbitrarily chosen oligonucleotide, using 
reduced stringency conditions. Once the double-stranded cDNA has been synthesized, 
amplification proceeds by standard PGR techniques, utilizing the same primers. The 

20 resulting DNA fingerprint is analyzed by gel electrophoresis and ethidium bromide staining 
or autoradiography. A side by side comparison of fingerprints obtained from tumor versus 
nonnal tissue samples using the same oligonucleotide primers identifies mRNAs that are 
differentially expressed. 

25 RNA fingerprinting technology has been demonstrated as being effective in identifying 

genes that are differentially expressed in cancer (Liang et al, 1992; Wong et aL, 1992: 
Sager et aL, 1993: Mok et al., 1994; Watson e( aL, 1994; Chen et at.. 1995; An et aL, 
1995). The present invention utilizes the RNA fingerprinting technique to identify genes that 
are differentially expressed in human prostate cancer. 



-97- 



PCT/US96/15780 



RT-PCR Protocols for Conrirmation of Differential Expression of niRNA 

A. Reverse transcription 

Five Mg of total cell RNA from each tissue sample was reverse transcribed into 
cDNA. Reverse transcription was performed with 400 units of MMLV reverse transcriptase 
(GIBCO/BRL) in the presence of 50 niM Tris-HCl (pH 8.3), 75 mM KCl, 3 mM MgCl2» 10 
mM DTT. 500 //M dNTP. 50 nc random hexamers per microgram of RNA, and 1 UZ/il 
RNase inhibitor. The reaction volume was 60 The reaction mixture was incubated at 
room temperature for 10 minutes, then at 37**C for 50 minutes. After reverse transcription 
the enzyme was denatured by heating to 65 °C for 10 minutes. After heat denaruration the 
samples were diluted with water to a fmal volume of 300 fiL 

Studies may be performed utilizing RT-PCR to examine mRNAs for differential 
expression, employing various oligonucleotides primers directed to nucleotide sequences of 
marker proteins of interest. These primers are used to direct the amplification of the cDNA 
fragments. 

B. Relative Quantitative RT-PCR With an Internal Standard 

The concentrations of the original total cell RNAs were determined by measurement 
of OV>260f28o (Sambrook e! ai, 1989) and confirmed by examination of ribosomal RNAs on 
ethidium bromide stained agarose gels. It is required thai all quantitative PCR reactions be 
normalized for equal amounts of amplifiable cDNA after the reverse transcription is 
completed. One solution to this is to terminate the reactions by driving the PCR reactions 
into plateau phase. This approach was utilized in some studies because it is quick and 
efficient. Lipoconin II was used as the internal standard or competitor. These PCRs were set 
up as: 

Reagents: 200 piM each dNTP, 200 nM each oligonucleotide primer, IX PCR buffer 
(Boehringer Mannheim including 1.5 mM MgCU), 3 /zl diluted cDNA, and 2.5 units of Taq 
DNA polymerase.' 100 fil of reaction volume. 
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Cycling parameters: 30 cycles of 94 *C for 1 min; 55**C for Imin; and 72 ''C for two min. 
Thermocyclers were either the MJ research ihermocycler or the Straiagene Robocycler, 

C. Relative Quantitative RT-PCR w^ith an External Standard 
5 There are three problems with the relative quantitative RT-PCR strategy described 

above. First, the internal standard must be roughly 4-10 times more abundant that the target 
for this strategy to normalize the samples. Second, because most of the PCR products are 
templated from the more abundant internal standard, the assay is less than optimally 
sensitive. Third, the internal standard must be truly unvarying. The result is that while the 
10 strategy described above is fast, convenient and applicable to samples of varying quality, it 
lacks sensitivity to modest changes in abundances. 

To address these issues, a normalization may be performed using both the fl-actin and 
asparagine synthetase mRNAs as external standards. These PCR reactions are performed 

15 with sufficient cycles to observe the products in the linear range of their amplification 
curves. Photographic negatives of gels of ethidium bromide stained PCR products are 
produced for each experiment. These negatives may be scanned and quantified using a 
BioRad densitometer. The quantified data may then normalized for variations in the staning 
concentrations of amplifiable cDNA by comparing the quantified data from each experiment 

20 with that derived from a similar experiment which amplified a cDNA fragment copied from 
the fl-actin mRNA. 

EXAMPLE XV. Results of Neural Networks Tools for Organ Confined Study 

These studies show the feasibility of using artificial neural networks (ANN) with 
25 noise factor for diagnosis of a cancer patient as organ confined (OC) or non-organ confined 
(NOC). The data set used by the inventors consisted of pathology and texmre variables for 
227 cases, and was used to generate the formula for the study. Ten cases did not have 
complete data, and were eliminated from the' data set before any training and/or test sets 
were created. Thus, 197 cases were used for training, with 20 cases saved for testing. 
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In order to find and extract all available information form the data set, ten different 
combinations of training sets and test sets were created. The training sets (197 data points) 
were randomly chosen from, the pool of 217 data points and the remaining 20 points were 
used for testing. This was done to compensate for the fact that the inventors had a small 
data set and to make sure the inventors have identified all useful information. Output \'alue 
of <0.5 indicates the patient is OC. and output value of >0.5 indicates the patient is NOC. 

The network configuration considered was a standard multilayer sigmoidal network 
with a single hidden layer. The learning rate used was 0.7 and the noise factor was 0.2. 
The neural network input layer consisted of 7 or 8 neurons to accommodate the input data set 
of 7 or 8 measurements. The activation function in each hidden layer and output layer 
neuron is sigmoid. Different network configurations (number of hidden layer neurons) with 
various training termination conditions were tested. For a given training data set, the neural 
network classifier tends to work best with 4 hidden layer neurons, and when the training is 
terminated at or below 10,000 iterations. Most cases, however, required around 1,000 
iterations. The output layer consists of two neurons, used to eliminate any chance of 
confusing the network during the training phase. 

A total of three different smdies were conducted, with two separate networks utilizing 
a standard backpropagaiion algorithm for training in each study. The results from these 
studies are set fonh in Tables XIX-XX. The first network was trained using the data from 
the inventors* database (UroCor, Inc., Tulsa. OK). The second network was trained after the 
training data was corrupted by adding noise, to force the network to identify useful 
information and discard the unwanted noise. 

In the first study, shown in Table XIX, the inventors used total of 7 inputs consisting 
of the seven variables used for model and the threshold PSA value. Each of the networks 
were tested using the ten different test sets described above, and the table shows the total 
number of correct classifications in each test set of 20. The first column presents results of 
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the tests where no noise was added. The second column presents results of the test cases 
where noise was added. Therefore, each table identifies the 20 lesi patterns used as well as 
the combined percent accuracy for both OC output and NOC output. 

5 For example, in Table XIX, comparing Test Set 2 and Test Set 10 (column 1) shows 

that it is possible to construct a network to classify a patient's status as OC or NOC very 
accurately, about 95% and 100% respectively. Additional manipulations of the data used for 
training these networks, within the scope of the skilled artisan, may result in determining the 
parameters required to make the network more accurate. It should be noted that repeatability 
10 at this point is directly a function of quality of data used for training. 

Further analysis of the individual responses in each test set thai is set forth in Table 
XIX points out the fact that many misclassified cases are very close to being correct. The 
inventors believe this problem may be resolved by adding noise to the data. To test this 

15 hypothesis the inventors created a new data set by adding random noise to the training set. 
As is evident from the results in Test Sets 1-10, the system performance was improved 
significantly. While some borderline cases which were initially classified correctly were 
misclassified because of the additional noise, modification of their noise input may be 
accomplished in such a way that such misciassification could be reduced as network learns 

20 more about the data during the training. 

For the second group of studies, set forth in Table XX, the inventors used the exact 
same variables used for the model, but the threshold PSA was replaced with its actual 
normalized value. Two networks were trained; one used noisy input while the other was 
25 trained with the input that had no added noise. Both networks had similar performance, 
about the same as the network with threshold PSA. 
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Table XIX 
Number Correct for 7 Inputs 
(no PrePSA) 





No Noise 


With Noise 


Test Set 1 


17 


17 


Test Set 2 


19 


19 


Test Set 3 


19 


18 


Test Set 4 


14 


15 


Test Set 5 


11 


12 


Test Set 6 


16 


14 


Test Set 7 


15 


16 


Test Set 8 


16 


15 


Test Set 9 


13 


14 


Test Set 10 


19 


20 


Overall Accuracy 


79.50% 


80.00% 
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Table XX 
Number Correct for 7 Inputs 
(no PrePSA-16) 





No Noise 


With Noise 


Test Set 1 


17 


17 


Test Set 2 


19 


19 


Test Set 3 


18 


18 


Test Set 4 


12 


12 


Test Set 5 


14 


14 


Test Set 6 


14 


14 


Test Set 7 


17 


17 


Test Set 8 


12 


13 


Test Set 9 


15 


15 


Test Set 10 


19 


19 


Overall Accuracy 


78.50% 


79.00% 



* * *r 

All of the methods disclosed and claimed herein can be made and executed with 
undue experimentation in light of the present disclosure. While the methods of this invention 
have been described in terms of preferred embodiments, it will be apparent to those of skill 
in the art that variations may be applied to the methods, and in the steps or in the sequence 
of steps of the method described herein without departing from the concept, spirit, and scope 
of the invention. More specifically, it will be apparent that certain agents thai are both 
chemically and physiologically related may be substituted for the agents described herein 
while the same or similar results would be achieved. All such similar substiniies and 
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modifications apparent to those skilled in the art are deemed to be within the spirit, scope, 
and concept of the invention as defined by the appended claims. 

The following is the Microsoft Excel v5.0 source code for the macro program used to 
convert the CMP v3.0 vector files as set forth in Example II. 
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* CMP Hacro 

* Macro recorded 6/16/94 by Craig Miller 
< Keyboard Shortcut: CtrZ-rc 

S\ib CNPO 

HagBox "To run this macro, you must first convert the listmode files us2.ng the CDI.5P IMPOR 
T.<OX program in CORTEX. If you have not done this, please select the file named CKD.XLS** 
ppenliiae : 

MagBox *Open the •.WKl Vector file which you wish to separate, or end the macro by seleccin 
9 Che EMD.WXl file" 
ChDrive "F" 

ChDir •'F:\USERS\MCM\4 0X" 

OpcnPile « Application. GctOpenFilename (-Lotus 1-2-3 C.wkl). •.wJcl-l 
Wor)dx>oKs . Open Filename : cOpenFile 
FileLength - Len (OpenFile) 
Searchstr « •X\- 

Period » InStr (OpenFile, Searchstr) 
OpenFilel - Mid (OpenFile , Period * 2) 
FileX«engthl « X#en (OpenFilel) 
SearchStri " . - 

Periodl ■ InStr (OpenFilel. SearchStri) 
(^penFile3 - Left (OpenFilel . Periodl - 1) 
CASENDM • Range (**A2'') .Value 

* If there are no values in the file« end the macro. If data is present, proceed 

If CASEHUM « Then GoTo I«astLine Else GoTo FormacLine 
formatLine : 

Application. S ere enUpda ting » False 
Cells . Select 
With Selection 

.HorizontalAlignment « xlCenter 

. VerticalAlignment » xlBottom 

.WrapText » False 

.Orientation - xlHorizontal 
End With 

With Selection. Font 

.Name « "Times New Roman* 
.FontStyle ■ "Regular" 
.Size m 10 

.Stri)ce through False 
.Superscript - False 
.Subscript w False 
.OutlineFont c False 
.Shadow False 
.Underline « xlNone 
.Colorlndex « xlAutomatic 
End With 

ColUBBis ( "B : C" ) . Select 
Selection. Delete Shif t :«xlToLef t 
Columns (*£:G") .Select 
Selection . Insert Shi f t : > xlToRight 
Columns ( "A: AE") .Select 
Selection. Cut 

Workbooks . Open Filename : « " C : \CRAIG \ MACROS \WK1CNVRT . ELK" 

ActiveSheet . Paste 

Range("B:D,H:AD-) .Select 

Selection. NumberFormat - "0.000000" 

Range (-E:G-) .Select 

Select ion. Number Format « "0.00" 

Columns ( "A: AE" ). Select 

Selection . Ent i reColumn . AutoFi t 

With Selection. Borders (xlLeft) 

.Weight - xlThin 

.Colorlndex « xlAutomatic 
End With 

With Selection. Borders (xlRight) 

.Weight « xlThin 

.Colorlndex « xlAuto.T*atic 
End With 



wo 97/12247 



PCT/US96/15780 



With Selection. Borders (xlTop) 

.Weight = xlThin 

.Colorlndex « xlAutomacic 
End with 

With Selection. Borders (xlBottom) 

.Weight « xlThin 

.Colorlndex > xlAutomatic 
End With 

Selection . Border Around Lines tyle : *xiNone 
Range {"A2'') .Select 
Application.ScreenUpdating o True 
Chbrive "F" 

ChDir -F:\USERS\MCM\40X\CMP- 
With ActiveWor)cbook 
.Title « 
.Subject B 

.Author « "Craig Miller" 
.Keywords • 
.Comments « 
End With 

ActiveWorkbook.SaveAs Filename : «OpenFile2 , FileFormat :«xlNormal. _ 
Password: , writeResPassword: ReadOnlyRecommended ; « _ 
False, CreateBackup: -False 
Application.ScreenUpdating * False 
CASENUM - Range ("A2") .Value 
While CASENUM <> 
Select all cells pertaining to this particular case 
Application.ScreenUpdating » False 

y - 3 

While cells (y, 1) - CASENUM 
y - y 4 1 

Wend 

Select the cells for one case, copy them to a blank CMP template, ^ 
and delete the first and last row (blank rows from template) . " 
Range (Cells (2. 1). Cells (y - 1. 31}). Select 
Selection .Copy 

Workbooks. Open Filename :.-C: \CRAIC\MACROS\ CASCMP.BLK* 

Selection. Insert Shift : mxlOown 

Rows ("3: 3") .Select 

Application .Cut CopyModc « False . 

Selection. Delete Shift :«xlUp 

y . 3 

While Cells (y, 1) - CASENUM 
y ■ y + 1 

Wend 

Range (Cells (y. l) , Cells (y, 31)). Select 

Selection. Delete Shift :«xlUp 
Make the first column the cell number 

Range ( " Al « > . Select 

ActiveCell.FormulaRlCl « "Cell" 

Range ( ** A3 ) . Select 

ActiveCell.FormulaRlCl - ♦'I- 

Range { " A4 » ) . Select 

ActiveCell.FortnulaRlCl » "2" 

Set Source « Range (Cells (3 . 1), Cells (4, 1}) 

Set FILL - Range (Cells (3. 1). Cells (y - 1, D) 

Source. AutoFill destination : e^FILL 
sort all of the cells according to first the Area and second the Shape 

Set CELLDATA - Range (Cells ( 3 . 1). Cells (y - 1, 31)) 

CELLDATA . Sort Xey 1 : «Range ( "C3 " I , Orderl : ^xlAscending , Key2 : eRange 
CD3"). Order2:cxlAscending. Header: -xlGuess , Ordercustora: el , 
MatchCase : « False , Orientation : exlTopToBottom 

Columns («E:E-) .Select 

Selection. Delete Shif t : ^xlToLef t 
Select the position where the DNA and perimeter information will be 
pasted, and open the DNA file created in CORTEX having the same name 
as the • . ILM file. 

Range ( **E3 " ) . Select 

CASENUM e Trim (CASENUM) 

CtiDrive "F" 
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ChDir "F:\USERS\MCM\PrS- 

workbpoks .OpenText Filename : «CASEmJM, Origin: « _ 

xlWindows. ScartRow:-i2, DataType : exlDelimiced. TextOualifier 
: cxlDoubleC'JOte , ConsecutiveDelimiter t » False , Tab: -False. _ 
Semicolon: -False < Comma :» True , Space : «False . Other : oFalse . _ 
Fieldlnfo: -Array (Array (1, 1), Array 12, 1), Array (3, IJ . Arriy;4, i)) 
Detemine the number of cells present and calculate the perimeter for _ 
each cell 
I « 1 

While Cells tl. 1) > 0 
I « I ♦ 1 

Wend 

Set Source ^ cells (1« 5) 

Set FILL = Range (Cells (1. 5) , Cells (2 • 1. 5)) 

Range ( -Ei - ) . Select ' 

AcniveCell . FormulaRlCl "»S0RT(RC (-4) •RC (-31 ) 

Source. AutoFill destination : cFILL 

Range (Cells (1, S) , Cells (I - l, 5)). Select 

Selection . Copy 

Selection. PasteSpecial Paste : «xlValues , Operation : «xlNone, _ 

SkipBlanks: .False, Transpose : -False 
Application. Cut CopyMode • False 
Sort the cells according to first the Area and second the Shape 
Rows C'lzl") .Select 

Selection . Sort Keyl : -Range ( "Al • ) , Ordcrl : «xlAsccnding . Key2 : -Range _ 
("Bl«}« Order 2 : sxlAscending , Header :«xlGuess , OrderCustom: »l, _ 
MatchCase: -False, Orientation :-xlTopToBot torn "* 
Copy aad paste the DKA and Perimeter data to the full database and _ 
format the selection 

Range (Cells (1, 4). Cells (I - 1, 5)). Select 
Selection . Copy 

Windows ( ''CASCMP.BUC-) .Activate 
ActiveSheet . Paste 
Application. CutCopyMode - False 
With Selection. Font 

.Name - "Times New Roman" 

.FontStyle • "Regular" 

.Size - 10 

.Strike through « False. 

• Superscript «» False 

.Subscript « False 

.OutlineFont « False 

.Shadow « False 

.Underline • xlNone 

.Color Index - xlAutomatic 
2nd With 
With Selection 

.Horizontal Alignment « xlCenter 

. VerticalAlignment - xlBottom 

.WrapText » False 

.Orientation « xlHorizontal 
End With 

With Selection. Borders (xlLeft) 

.Weight • xlThin 

.Colorlndex - xlAutomatic 
£nd With 

'With Selection. Borders (xlRight) 
■Weight - xlThin 
.Colorlndex - xlAutomatic 

End With 

vith Selection. Borders (xlTop) 

.Weight « xlThin 

.Colorlndex - xlAutomatic 
End With 

With Select ion. Borders (xlBottom) 

.Weight - xlThin 

.Colorlndex « xlAutomatic 
End With 

Selection . BorderAround LineStyle : ^xlNone 
Enter the column headings zor the DNA and Perimeter columns 
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Range (••£!••) .SeXecL 
ActiveCell.ForrnulaRlCl « "Pg" 
Range ( '*C2*') .Selecc 
ActiveCell.FormulaRlCX » **DNA'' 
Range ("Fl") .Select 
ActiveCell.FormulaRlCX = "CAS" 
Range ("F2-) .Select 

ActiveCell.FonnulaRlCX - "Perimeter" 
Application. Goto Reference :« "Stats" 
Selection . Copy 

Selection. PasteSpecial Paste : «xlValues , Op>cration: oxlNone. _ 

SkipBlanks : -False , Transpose : -False 
Application.CutCopyMode « False 
Columns ("FtF") .Select 
Selection. Cut 
Columns < *' A£ : A£" ) . Select 
Selection. Insert Shift : =xlToRight 
Select ion. Number Format e "O. 000000" 
Range (Cells (3, 5). Cells (y - 1. 5)). Select 
Select ion. Number Format « "0.00" 
Shects(*CMP Blank •) .Select 
Sheets ("CMP Blank"*) .Name = CASENUM 
Sort the cell data according to the Cell Number and save the workbook _ 
with the same filename as the ♦ . XLM file 

Set CEIXDATA - Range (Cells (3, 1), Cells (y - 1. 30)) 

CELLDATA. Sort Key X : -Range ("A3") , Order! : •xlAscending. Header : =xXGuess . _ 

OrderCustomteX, MatchCase :-FaXsc, Orientation :«xlTopToBottom 
Cells (1, 1) .Select 
. With ActiveWorkbook 

.Title « 

.Subject » -" 

.Author « "Craig Killer" 

.Keywords ■ "" 

. Comments ■ " " 
End With 

ChDir "F:\USERS\MCM\40X\CMP- 

ActxveWorkbook.SaveAs Filename : -CASENUM. FileFormac : -xlNormal _ 
, Password:-"", Wri teRes Password . RcadOnl yRe commended : - _ 
False, CreateBackup: -False 
Select the Statistical Calculations, paste them to a blank spreadsheet, _ 
and cut and copy ail of the values to a single row. 
Range (Cells (y + 2, 1), Cells (y ♦ 6, 30)). Select 
Selection . Cut 

workbooks .Add Template :» "Workbook" 
ActiveSheet . Paste 
Range ("Al") .Formula « CASENUM 
Range ("B2 :AD2*') .Select 
Selection . Cut 
Range ("AJEl") .Select 
ActiveSheet . Paste 
Range ("B3 :AD3*) .Select 
Selection. Cut 
Range <**BH1") .Select 
ActiveSheet . Paste 
Range ("B4 :AD4") .Select 
Selection. Cut 
Range ("CKl") .Select 
ActiveSheet . Paste 
Range ("B5 :AD5") .Select 
Selection. Cut 
Range ( "DNl " ) . Select 
ActiveSheet . Paste 
Rows ("1:1") .Select 
Selection . Copy 
Paste the statistics for the case into the summary file 

Workbooks .Open Filename : r - F : \USERS\MCM\4 0X\SUM-OD. CMP" 
Selection. Insert Shift : -xlDown 
Application.CutCopyMode - False 
Active Workbook . Save 
Act iveWindow. Close 
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ActiveWorkbook . Saved « True 
ActiveWindow . Close 
Activewprkbook . Saved « True 
ActiveWindow . Close 
Accive Workbook. Saved = True 
ActiveWindow. Close 
Selection. Delete Shift rexlUp 
Range {"A2") .Select 
Application. ScreenUpdating « True 
•continue looping the macro until all of the cases contained in this vector 
file have been separated and no data is left in the vector file. 
CASENUM - Range (-A2") .Value 
Wend 

'If there is more than one vector file the separate, open the next file 

ActiveWorkbook. Saved = True 

ActiveWindow . Close 

ActiveWorkbook. Saved » True 

ActiveWindow . Close 

GoTo OpenLine 
> End the macro and refresh the screen 
X^astLine : 

ActiveWorkbook. Saved c True 

ActiveWindow . Close 

Application. ScreenUpdating ^ True 
End Sub 
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The following is a fJowchart and the source code listing of a computer program for 
the DNA content information import as set forth in Example II. 
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n^CTlON IMPORT. ILM 
IMPORT. ILM 



NOTICE: THIS SOFTWARE IS THE VALUABLE CONFIDENTIAL TRADE SECRET 

PROPERTY Or ADEPT TECHNOLOGIES. INC. 

(ADEPT TECH.) OF HOUSTON, TEXAS. IT MAY NOT BE USED. 
DISCLOSED, OR REPRODUCED BY ANY MEANS WHATSOEVER WITHOUT 
THE PRIOR WRITTEN AUTHORIZATION OF ADEPT TECHNOLOGIES, 



PROGRAM 

XETDRNS 

DESCRIPTION 

TYPE 

DATS 

AOTHOR 

MODS 

PLANS 



IMPORT. ILM 

INFORMATION - TO DNAILM, CALLING ROUTINE. 

IMPORT CAS DNA * . ILM LISIWODE DATA 

PROGRAM 

10 MARCH 93 

Michael P. Bacus 

05 MARCH 93 - VERSION 1.00 - FIRST USEABLE VERSION 

IS MARCH 93 • VERSION 1.01 - BOG FIX, OFF BY ONE ON PG HISTOGRAM 

17 JUNE 93 - VERSION 1.02 - MEAN NUCLEAR ROUNDNESS 



DECLARE SUBROUTINE DIRLI ST, MSG. PRINT, ILM 
DECLARE FUNCTION SEQ 

OPEN "CDI.CAS.ILM" TO CDI. ILM. FILE ELSE STOP 

OPEN -CDI. CAS" TO CDI. CAS. FILE ELSE STOP 

OPEN "CASE- TO C.FILE ELSE STOP 

COUNT - 0 

PI - 3.1415926535 



NEW. PATH « "C:\TEMP\" 
INITDIR NEW,PATH:"*.ILM* 
RD.NAME « 'RD' 
RD.CNT m 0 
ANSWER ■ ' ' 
AMSWERcl> • 0 
ANSWER<2> m 0 
LOOP 

LIST - DIRLISTO 
UNTIL LIST « ' ' 

COL m 0 

LOOP 

REMOVE FILE. NAME FROM LIST AT COL SETTING FLAG 
ONTIL FLAG - 0 DO 

OLD. FILE. NAME « FILE. NAME 

OLD. FILE. NAME [-3, 3) « 

OLD . FILE . NAME « OLD. FILE .-NAME :' CYT' 

EXTENSI9N « FILE. NAME (-3, 3) 

ILM. ID « ' • 

IF EXTENSION "ILM" THEN 
CHECK FOR SIMILAR •.ILM UPLOAD, QUICKLY 
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OLD. FILE. R£C « " 

OSOPEN NTW. PATH: OLD •'PILE.NAME TO OLD. ILM. FILE 'HiEN 

OSBREAD OLD. FILE. JC FROM OLD. ILM, FILE AT 0 ;NGTK 61000 
OSCLOSE OLD. ILM. FILE 

END 

COUNT = COUNT + 1 

OSOPEN NEW. PATH: FILE. NAME TO ILM. FILE THEN 

OSBREAD FILS.REC FROM ILM. FILE AT 0 LENGTH 61000 
IF OLD. FILE. REC = FILE.REC THEN 

♦ ACCESSION NUMBER, RECORD KEY 

ILM. REC « ' ' 

ILM.NUM = 13; REC.POS = 301; REC.LEN » 33 

GOSUB GET. STRING 

ILM. ID = TRIM ( ILM. R£Ccl3>) 

ILM. ID ^ ILM. ID [1,10] 

READ DUMMY FROM CDI . ILM. FILE, ILM. ID THEN 

PCPERFORM "DEL NEW. PATH: FILE. NAME 
END ELSE 

GOTO MONKEY: 
END 
END ELSE 

DNKSY: 

ILM. REC « " 

♦ ILM. VERS ION 

ILM.NUM « 1; REC.POS « i; REC.LEN = 2 
GOSUB GET. STRING 

♦ SIGNATURE 

ILM.NUM = 2; REC.POS « 3; REC.LEN « 10 
GOSUB GET. STRING 

♦ COMPANY 

ILM.NUM «.3; REC.POS « 13; REC.LEN « 64 
GOSD3 GET. STRING 

♦ INSTRUMENT 

ILM.NUM « 4; REC.POS a 77; 'REC.LEN = 64 
GOSUB GET. STRING 

♦ STAIN 

ILM.NUM 1^ S; REC.POS « 141; REC.LEN « 64 
GOSUB GET. STRING 

♦ FILTER WAVELENGTH 

ILM.NUM » 6; REC.POS « 205; REC.LEN « 2 
GOSUB GET. INTEGER 

♦ FILTER BANDWIDTH 

ILM.NUM « 7; REC.POS « 207; REC.LEN « 2 
GOSUB GET.INTEGEIR 
*■ MAGNIFICATION 

ILM.NUM • 8; REC.POS » 209; REC.LEN - 2 
GOSUB GET . INTEGER 

♦ TIME OF CREATION 

ILM.NUM « B; REC.POS « 211; REC.LEN = 16 
GOSUB GET. STRING 

♦ TIME OF ANALYSIS 

ILM.NUM « 10; REC.POS » 227; REC.LEN « 8 
GOSUB GET. STRING 
- OPERATOR ID 

ILM.NUM «= 11; REC.POS «= 23S; REC.LEN » 33 
GOSUB GET. STRING 

♦ PATIENT NAME 

ILM.NUM « 12; REC.POS = 268; REC.LEN « 33 
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GOSUB GET. Sl.-iiivvj 
ACC£SSION NUMBER, RECp^D KEY 

ILM.NUM = 13; R :.?0S = 301; RECLEN « 33 

GOSUB GET. STRING 

ILM.ID = TRIM(Il4M.REC<13>) 

ILM.ID « ILM.ID(1,10J 
SEQUENCE PATH 

ILM.NOM « 13; REC.POS = 334; RSC.LEN = 201 

GOSUB GET. STRING 
DNA INDEX 

ILM.NUM « 14; REC.POS = 535; REC.LEN « 8 

GOSUB GET. DOUBLE 
FIELD COUNT 

ILM.NUM = 15; RJEC.POS = 543; REC.LEN .-.2 

GOSUB GET. INTEGER 
CELL COUNT 

ILM.NUM « 16; REC.POS « 545; REC.LEN « 2 

GOSUB GET. INTEGER 
PEAK VALUE 

ILM.NUM = 17; REC.POS « 547; REC.LEN « 2 

GOSUB GET. INTEGER 
CLASS NAME, SI 

ILM.NUM « 18; REC.POS = 549; REC.LEN « 13 

GOSUB GET. STRING 
CLASS NAME, S2 

ILM.NUM «= 19; REC.POS « 562; REC.LEN = 13 

GOSUB GET. STRING 
CLASS NAME, #3 

ILM.NUM « 20; REC.POS «= 575; REC.LEN = 13 

GOSUB GET. STRING 
CLASS NAME, i^4 

ILM.NUM - 21; REC.POS - 588; REC.LEN « 13 

GOSUB GET. STRING 
CLASS NAME, 

ILM.NUM « 22; REC.POS - 601; REC.LEN = 13 

GOSUB GET ..STRING 
CLASS NAME, #6 

ILM.NUM « 23; REC.POS = 614; RECLEN = 13 

GOSUB GET. STRING 
CALIBRATION REJECTED CELL ARRAY 

ILM.NUM « 24; REC.POS « 627; REC.LEN > 128 
GOSUB GET. CHAR. ARRAY 
O^ilBRATION NUCLEAR MASS ARRAY 

ILM.NUM « 25; REC.POS « 627; REC.LEN « 128 

ILM-NUM « 25; REC.POS = 627; REC.LEN « 128 

GOSUB GET. FLO AT. ARRAY 
CALIBRATION NUCLEAR AREA 

ILM.NUM « 26; REC.POS « 1267; REC.LEN « 128 

ILM.NUM « 26; REC.POS « 1139; RSC.LEN = 128 

GOSUB GET. LONG. ARRAY 
ANALYSIS FIELD COUNT 

ILM.NUM « 51; REC.POS « 1651.; REC.LEN = 2 

GOSUB GET. INTEGER 
AIn^ALYSIS CELL COUNT 

* ILM.NUM = 27; REC.POS = 1779; REC.LEN » 2 

]^.NUM ^ 21; REC.POS « 1653; REC.LEN « 2 
GOSUB GET. INTEGER 

ANALYSIS CELL TYPE 

ILM.NUM « 28; REC.POS = 1781; REC.LEN = 512 
ILM.NUM = 28; REC.POS «= 1655; REC.LEN = 512 



-114- 



wo 97/12247 



PCT/US96n5780 



AJmAI-YSIS UUChZPJl MASS 

ILM.NUK = 29;-^EC,POS « 2253; REC.LEN = 512 

ILM.NUM = 29; JC.POS = 21S7; REC.LSN « 5 

G0SU3 GET. FU)AT. ARRAY 
ANALYSIS NUCLSAR AREA 

ILM.NUKi « 30; PJEC.POS = 4341; REC . LHN = 512 

ILM.NUM = 30; REC.POS « 421S; REC.LEN = 512 

GOSUB GS?. LONG. ARRAY 
1280 BIN PICOGRAM HISTOGRAM 

ILM.NDM e 31; REC.POS « 6389; REC.LEN = 1280 

ILM.NUM « 3X; REC.POS « 5263; REC.LEN = 1280 

GOSOB GET. INT. ARRAY 

RD.CNT += 1 
FIRST PEAK , • ^^^^ 

ILM.NUM = 52; REC.POS » 8823; REC.LEN = 2 

GOSUB GET. INTEGER 
INTEGER, SECOl^D PEAK " ^ 

ILM.NUM « 53; REC.POS = 8825; REC.LEN « 2 

GOSUB GET. INTEGER 
FIELD AREA A, FLAG = 1 ON FLAG = ON 

ILM.NUM « 33; REC.POS ^ 8827; REC.LEN * 2. 

GOSUB GET. INTEGER • 
MINIMUM BOUNDARY FIELD A 

ILM.NUM = 34; REC.POS = 8829? REC.LEN « 2 

GOSUB GET. INTEGER 
MAXIMUM BOUNDARY FIELD A 

ILM.NUM « 35; REC.POS - 8831; REC.LEN = 2 

GOSUB GET. INTEGER 
FIELD B 

ILM.NUM^ 36; REC.POS « 8833; REC.LEN » 2 

GOSUB GET. INTEGER 
MINIMUM BOXJNDARY FIELD 3 

ILM.NUM « 37; REC.POS « 8835; REC.LEN «= 2 

GOSUB GET. INTEGER 
MAXIMUM BOUNDARY FIELD B 

ILM.NUM « 38; REC.POS « 8837; REC.LEN » 2 

GOSUB GET. INTEGER 
FIELD C 

ILM.NUM = 39; REC.POS 8839; REC.LEN » 2 

GOSUB GET. INTEGER *. ■ 

MINIMUM BOUND.IVRY FIELD C 

ILM.NUM = 40; REC.POS •= 8841; REC.LEN = 2 

GOSUB GET. INTEGER 
MAXIMUM BOUNDARY FIELD C 

ILM.NUM « 41; REC.POS « 8843; REC.LEN « 2 

GOSUB GST. INTEGER 
FIELD D 

ILM.NUM = 42; REC.POS " 8845; REC.LEN » 2 

GOSUB GET. INTEGER 
MINIMUM BOUNDARY FIELD D 

ILM.NUM = 43; REC.POS «= 8847; REC.LEN = 2 

GOSUB GET. INTEGER 
MAXIMUM BOUNDARY FIELD D 

ILM.NUl^ = 44; REC.POS = 8849; REC-LEN = 2 

GOSUB GET. INTEGER 
FIELtf E 

ILM.NUM = 45; REC.POS « 8851; REC.LEN « 2 
GOSUB GET. INTEGER 
MINIMUM BOUNDARY FIELD E 
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GOSUB GET. INTEGER 

* MAXIMUM BOUNDARY F; D Z 

ILM.NUM « 47; Rr-C . POS = B855; REC.LEN « 2 
GOSUB GET. INTEGER 

* FIELD F 

ILM.NUM = 4B; REC.POS » 8857; REC.LEN « 2 
GOSUB GET. INTEGER 

* MINIMUM BOUNDARY FIELD F 

ILM^NUM = 49; REC.POS = 8859; REC.LEN « 2 
GOSUB GET. INTEGER 

* MAXIMUM BOUNDARY FIELD F 

ILM.NUM = 50; REC.POS = 8861; REC.LEN « 2 
GOSUB GET. INTEGER 
CELL RE:VIEW, DELETED 

ILM.NUM « 32; REC.POS = 8863; REC.LEN « 2 
GOSUB GET. INTEGER 

RZAD CELL DATA 

ADJUST - 0 
BUF.CNT " 1 
CELL. OFFSET - 0 
TEMP-ILM.REC = ILM.REC 

IF 1LM.REC<27> > 512 THEN ILM.REC<:27> » 512 
FOR I. CELL = 1 TO ILM.REC<27> 

IF (88 63+CELL.OFFSET+36) > (6 0000 ♦BUF.CNT) THEN 
FILE.POS « BUF.CNT* 60000 

OSBREAD FILE.REC FROM ILM.FILE AT FILE.POS-100 LENGTH 61000 
IF BUF.CNT • 1 THEN 

ADJUST « ADJUST + 60000-100 
END ELSE 

ADJUST m ADJUST * 60000 
END 

BUF.CNT +« 1 
END 

CELL XMIN 

ILM.REC<90> - 0 

ILM.NUM » 90; REC.POS « 8 8 63 +CELL. OFFSET -AD JUST; REC.LEN « 2 
GOSUB GET«INT£GSR 

TEMP.ILM.R£C<90,I.CELL> » ILM.REC<90> 

CELL XKAX 

ILM.REC<91> - 0 

ILM.NUM - 91; REC.POS « 8865+CELL.OFFSET-ADJUST; RECLEN - 2 
GOSUB GET. INTEGER 

TEMP.ILM.REC<91, I.CELL> » ILM.REC<:91> 

CELL YMIN 

ILM.REC<92> - 0 

ILM.NUM « 92; REC.POS « 8 8 67 +CELL. OFFSET -AD JUST; REC.LEN » 2 
GOSUB GST. INTEGER 

TEMP.ILM.REC<92,I.CELL> « ILM.REC<92> 

CELL YMAX 

ILM.REC<93> « 0 

ILM.NUM « 93; REC.POS « 88 6 9 -♦•CELL. OFFSET -ADJUST; REC.LEN » 2 
GOSUB GET. INTEGER 

.TEMP.ILM.REC<93 , I.CELL> = ILM.REC<93> 
CELL CLASS 

ILM.R£C<94> » 0 

ILM.NUM = 94; REC.POS « 8 87 1+CELL . OFFSET- ADJUST; REC.LEN « 2 
GOSUB GET.nrTEGER 
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» CELL OFFSET 

ILM.H£Cc95> ' "^O 

ILM.KUI-l = 55, RECPOS = 8873-rCELL . OFFSI .J^J^JUST; REC.L-N « ' 
G0SU3 GET.i^ONG ' 
TEMP.ILM.REC<95, I.CELL> = rLM;RSC<9S> 

* CELL PS£P, 

ILM.R£C<S6> = 0 

ILM.NuM = 96; REC.POS « 8 8 77 +CELL .OFFSET -ADJUST; REC.LEN « * 
G0SU3 GET. LONG 

TEMP.ILM.RSC<96, I.CELL> = ILM.RECc96> 

* CELL BOF SIZE 

ILM.REC<97> « 0 

ILM.KDM « 97; REC.POS « 888 1+CELL .OFFSET -AD JUST; REC.LEN := 2 
GOSra GET. INTEGER 

TEMP.ILM.REC<97, I.CELL> = ILM.REC<97> 

* CELL MASS 

ILM.REC<98> 0 

ILM.NUM = 98; REC.POS = 8 8 83 +CELL. OFFSET -ADJUST; REC.LEN = 1 
GOSUB GET. FLOAT 
IF LEN(ILM..REC<98>) > 10 THEN 
TEMP.ILM.REC<98,I,CELL> « 0 
END ELSE 

TEM?.ILM.R£C<98,I.CELL> « ILM.REC<98> 
END 

* CELL SHAPE 

ILM.REC<99> .= 0 

ILM.NUM « 99; REC.POS « 8 8 87+CELL . OFFSET- ADJUST; REC.LEN = 1 
GOSUB GET. FLOAT 

TEMP.ILM.REC<99, I.CELL> « ILM.REC<99> 

* CELL NAME 

ILM.REC<100> =0 

ILM.NUM = 100; REC.POS = 88 9 1 +CELL . OFFSET- ADJUST ; REC.LEN « 6 
GOSUB GET. STRING 

TEMP.ILM.RECclOO, I.CELL> ^ ILM.REC<100> 
Calcul&ce Mean Nuclear Roundness 

*> 

PERIMETER « SQRT (TEMP . ILM.RECcSfi , I . CELL> *TEMP . ILM . RECc9 9 , I . CELL> ) 
TEMP.ILM.R£C<101, I.CELL> « ICONV( (PERIMETER/ (2 /SORT (TEMP. IL? 
TEMP.ILM.RECclOl, I.CELL> - ICONV(SQRT (TEMP . ILM.REC<99 , I . CELL>/ (4 

CELL. OFFSET = CELL. OFFSET + rLM.REC<:97> + 36 
NEXT I. CELL 
ILM.REC TSMP.ILM.RSC 

* FIGURE OUT STATISTICS 
* 

FOR I « 1 TO 6 
BEGIN CASE 
CASE I » 1 

ON. FLAG « ILM.RSCc33> 

MIN « ILM.RSC<34> 

MOC « ILM.REC<35> 
CASE 1=2 

ON. FLAG = ILM-REC<36> 

MIN = ILM.REC<3 7> 

KJO: « ILM.REC<3 8> 
CASE I « 3 

ON. FLAG = ILM.RECtf3 9> 
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MAX = ILM.-ISC<41> 

ON. FLAG « .LM.RECc4 2> 

MIN «= ILk.REC<43> . 

MAX = ILM.REC<44> 
CASE 1=5 

ON. FLAG = ILM.REC<.45> 

MIN = ILM.REC<46> 

MAX = Il4M.REC<47> 
CASE I « 6 

ON. FLAG « rLM.R£Cc4B> 

MIN = ILM.REC<49> 

MAX « IIiM-REC<SO> 
END CASE 

MIN = INT(MIN*lIiM.REC<14>*10+.5)/l00 
MAX = INT(MA>:*ILM.RECcl4>*10+.5)/l00 
MODAL. DI = 0 / 
MEAN.DI « 0 
VARIENCE = 0 
STANDAJID. DEVIATION « 0 
COEFFICIENT. VARIATION « 0 
PERCENT . CELLS » 0 
CELL, COUNT =0 
IF ON, FLAG = 1 THEN 
SUM. SQUARED « 0 
SUM. RAW « 0 
MODAL « 0 
FOR J 1= 1 TO 12B0 

IF ILM.REC<31, J> 0 THEN 

IF ({J-1 >« MIN) AND (J-1 MAX)) THEN 
IF ILM.RECc31, J> > MODAL THEN 
MODAL « ILM.REC<31, J> 
MODAL. DI = (J-1)/10/ILM.R£C<14> 
. END 

FOR I< « 1 TO ILM*.REC<31, J> 
DI » {J-1)/10/ILM.REC<14> 
SUM. SQUARED DI**2 
SUM. RAW DI 
CELL. COUNT 1 
NEXT K 
END 
END 
NEXT J 

IF CELL. COUNT ^ 0 THEN 

MEAN.DI « (SUM. RAW/CELL. COUNT) 
END ELSE 

MEAN.DI s 0 
END 

IF CEXJj. COUNT > 1 TEEN 

VARIENCE = ( SUM . SQUARED- ( ( SUM . RAW* SUM . RAW ) /CELL . COUNT ) ) / ( CELL . 
IF VARIENCE « 0 THEN 

STANDARD. DEVIATION «■ 0 
END ELSE 

STANDARD . DEVIATION « SQRT (VARIENCE) 
END 

COEFFICIENT. VARIATION » (STANDARD . DEVI AT ION/ MEAJN . DI ) '100 
END ELSE 

VARIENCE « 0 

STANDARD . DEVIATION « 0 
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END 

IF ILM.REC<2— i# 0 TH£N PERCENT . CELLS - C^LL . COUNT/ILK. R£C< 27 > 
END 

IU1.REC<48 + I*6> » ICONV (MODAL. DI, 'MJD2, ' ) 

ILM.REC<49 + I*6> = ICONV (MEAN . DI , ' MD2 , ' ) 

ILM.REC<S0 + I*6> = ICONV (STANDARD. DEVIATION, 'KD2, ' ) 

ILM.REC<51 + I*6> ICONV ( COEFFICIENT. VARI ATION, 'MD2, ' ) 

ILM.R£Cc52h-I*6> = I CONV (PERCENT. CELLS MD4 ) 

ILM.R£Cc53 + I*6> « CELL. COUNT 
NEXT I 

Filter Imported Data 

ILM.ID « '93-99999H' 
EXIST. FLAG « 0 

READ DtJMMY FROM GDI . ILM. FILE, ILM.ID THEN 

IF ILM.REC « DUMMY THEN EXIST. FLAG = 1 
END 

IF EXIST. FLAG « 0 THEN 

CONVERT DATA INTO OLD FORMAT FOR REPORT PRINTING 
> 

OFF. SCALE » 0 
Y. SCALE » 0 
CDI.CAS-REC « ' ' 
FOR I » 1 TO 1280 
IF 1 > 0 THEN 
BEGIN CASE 
CaiSE I c IS 

CDI.CAS.REC<1,X> « CDI.CAS.REC<1,1> 4- ILM.RZC<31, I> 
CASE I < 25 

CDI.CAS.REC<1,2> - CDI . CAS . REC<a , 2> + ILM.RECc31. I> 
CASE I < 35 

CDI.CAS.Ri:C<l,3> ■ CDI.CAS.REC<:1,3> ^ ILM . RECc3 1 , 1 > 
CASE I < 45 

CDI.CAS.REC<1,4> « CDI . CAS . RECcX , 4 > + ILM, REC<3 1 , I > 
CaSE I < 55 

CDI.CAS.RECcl.S> » CDI .CAS . REC<1, 5> ILM • REC<3 1 , I > 
CASE I < 65 

CDI.CAS.REC<1,6> « CDI . CAS . R£C<1 , 6> * ILM . REC<3 1 , 1 > 
CASE I < 7S 

CDI.CAS.REC<1,7> = CDI . CAS . RECcl, 7> + ILM.REC<31, 1> 
CASE I < 85 

CDI-CAS .RECd, B> » CDI .CAS -RSCcl, 8> + ILM,RSC<31 , I> 
CASE I < 95 

CDI.CAS.REC<1,9> « CDI .CAS .RECcl , 9> 4- ILM.REC<31, I> 
CASE. I < 105 

CDI.CAS.REC<1,10> e CDI.CAS.REC<1,10> + ILM . REC<3 L , I > 
CASE I < 115 

C2:>I.CAS.REC<1,11> « CDI.CAS.REC<1,11> -r ILM. REC<31 , I> 
CASE I < 125 

CD1.CAS.REC<1,12> « CDI . CAS . RECcl, 12> + ILM.REC<31, J.> 
CASE I < 135 

CDI.CAS.REC«:1,13> « CDI . CAS . RECcl . 13 > + IIJ<.REC<31 , i> 
CASE I < 145 

CDI.CAS.REC<1,14> = CDI.CAS.RECcl, 14> -k ILM . RECc3 1 , i > 
CASE I < 155 

CDI .CAS.RECcl, 15> « CDI . CAS . R£C<1 , 15> + ILM . R£Cc3 1 , I > 
CASE I < 165 
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CASE I < X7b 

CDI.'^^S.REC<1,17:> = CDI . . ?IEC<1 , 17 > ^ ILM.REC<3l t> 
CPS£ I . 185 

CDI.CAS.RECcl, 18> « CDI . CAS . R£C<^ , 18 > . ILM.REC<31 t> 
CASE I < 195 

CDI.CAS.RECcl,I9> « GDI . CAS .R£C<1 , 19 > ^ ILM.RECc31 r> 
CASE : < 205 

CDI.CAS.REC<1,20> = GDI . CAS.REC<1 , 20> - ILM . RECcSl , 1> 
CASE I < 215 

CDI.CAS.REC<1,21> = GDI . CAS . RECcl , 21> -r ILM . REC<31 , I> 
CASE I < 225 

CDI.CAS.REC<1,22> = GDI . CAS .RECcl. 22> ^ ILM .RJSC<31 , 1> 
CASE I < 235 

CDI.CAS.REC<1,23> = GDI . CAS.RECcl , 23> + ILM.REC<3.1, 1> 
CASE I < 245 

CDI.CAS-R£C<1,24> « GDI . CAS .REC<1 , 24 > + ILM.REC<31, I> 
CASE I < 255 

CDI.CAS,REC<1,25> « GDI . CAS .RECcl, 25> REC<31 , I> 

CASE I < 255 

CDI.CAS.REC<1,26> = GDI . CAS .RECcl , 26> ^ ILM. REC<31 , I> 
CASE I < 275 

CDI.CAS.REC<1,27> « GDI . CAS .REC<1 , 27 > - ILM . REC<3X , I> 
CASE : < 285 

CDI.CAS.REG<1,28> = GDI • CAS . REC<1 , 28 > -r ILM. REC<31 , I> 
CASE I < 295 

CDI.CAS.REC<1,29> = GDI . CAS.REC<1, 29> + ILM. RECc31 , I> 
CASE I < 305 

CDI.CAS.RECcl,3 0> = GDI . CAS .REC<1 , 30> ^ ILM. RECc31 , I> 
CASE I < 315 

CDI.CAS.REC<1.31> « GDI . CAS.REC<1 , 31> ^ ILM . REC<31 , I> 
CASE I < 325 

CDI.CAS.REC<1,32> - GDI . CAS .RECcl , 32> -r ILM . REC<31 , I> 
CASE 1 

OFF. SCALE « OFF. SCALE + ILM.REC<31, I> 
END CASE 
END 
NEXT I 

CDI.CAS.REG<2> = ILM.REC<12> 
IF ILM.REGc52> ? 65535 THEN 

IF ILM:REC<14>+.5 < 1 THEN 
CDI.CAS.REC<4> * 0 - 

END ELSE 

CDI.CAS.RECc4> « (INT (IXiM.REC<52>/ILM.REC<14>+ . S) J /lOO 

END 
END ELSE 

CDI.CAS.REC<4> « 0 
END 

CDI.CAS.REC<12> « OFF. SCALE 
Y. SCALE » 0 
Y.POS B 0 

FOR I « 1 TO 32 

IF CDI.CAS.REC<1. I> > Y. SCALE THEN 
y. SCALE « CDI.CAS.REC<1, I> 
y,POS » I 
END 
NEXT I 

Y.DIV = (Y.SCALE*!.!) /5 

TRAKS = 0 

IF y.POS > 0 THEN 
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END 

FOR I = 1 TO ^ 

TEM? ^ CDi.wiS.REC<l, I>*TRANS 
IF TEI^P > 0 AND TEMP < 1 THEN 

CDI.CAS.RSCcl, I> = 1 
END ELSE 

CDI .CAS.R£C<1, I> « INT(TSMP-i-.5) 
END 
NEXT I 
FOR I « 1 TO 32 

TEMP « INT(CDI.CAS.RECcl,I>*TRANS+.5) 
IF TEMP > 0 AND TSMP < 1 THEN 

CDI.CAS.REC<1,I> « 1 
END ELSE 

CDI.CAS.REC<1,I> = TEMP 
END 
NEXT I 

FOR I e 1 TO 5 

CDl.aLS.RECc23-I> = INT(Y.DIV*I) 
NEXT I 

CDI.CAS-REC<23> « 0 
GOSUB KNR. EXPORT 

WRITE CDI.CAS.REC TO CDI . CAS . FILE, ILM.ID 
WRITE ILM.REC TO CDI . ILM. FILE, ILM.ID 

PRINT. ILM (ILM.ID) 

PREVIOUS PATIENT HISTORY 

PATIENT. NUMBER « XLATE ( ' CASEMLM. ID, 8 , ' X' ) 
PCPERFORM "CAPTURE J«J0B12 TI«0 > NUL;" 
IF PATIENT . NUMBER ^ THEN 

OPEN 'LISTS' TO LISTS /FILE ELSE STOP 

LISTS. ID = ' CR' :@STATION 

SECOND- ID = ' SR' : ©STATION 

FIRST. LIST = ' ' 

FIRST. LIST<1> ^ ILM.ID 

FIRST. LIST<2> « 0 > 
SECOND. LIST « ' ' 
SECOND -LIST<1> « ILM.ID 
SECOND. LIST<2> = 0 
WRITS FIRST. LIST TO LISTS. FILE. LISTS. ID 
WRITE SECOND. LIST TO LISTS. FILE, SECOND. ID 
PERFORM "SELECT CASE WITH PATIENT. NUMBER = PATIENT. NUI-SER 

IF ©RECCOUKT > 0 THEN ^-^^^-x, 

PERFORM "SELECT CASE WITH DESIGNATOR. CODE = 'C OR TviTr. DESIGN 
IF ©RECCOUNT > 0 THEN 

PERFORM "SELECT C^SE WITH RECEIVED . DATE GE 'l-i-91' AlTD WITH 
NEWLIST = " 
DONE = 0 
LOOP. 

READNEXT ID ELSE DONE = 1 
WHILE DONE « 0 

IF ID ^ ILM. ID [1,9) THEN 
NEWLIST<-1> = ID 

END 
REPEAT 
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COT = COUN•l•^NEWLiST,®F^i^ iw-wi^xoi T- i 
FOR I TO FM.CNT 

C "^L MEWLIST<I> II. 9] 

READ C.REC FROM C.?ILE,C.ID THEN 

LOCATE 29 IN C.RECcl6> USING ©n/M SETTING ?0S THTN 
0NE.LIST<-1> = Ni:WLISTcI> 

^^oStE 21 in C.REC <:16> USING ©VM SETTING ?0S THEN 
ONE-LIST<-l> = NEWLIST<I> 
END 
END 
END 
ICEXT I 

;^3N0RMAL.LIST = " f nKT? t tct « 

TI^.CNT = COUNT (ONE. LIST, ©FM) (ONE. LIST tt ) 

'^OR I = 1 TO FM.CNT 

ABNORMAL. LIST<I> = ONE.LIST<I> 

NEXT I 

^^cS£^^ . LlS^ , ^NORMAL . LI^^^ 

PERFORM -MERGE CD I. GREY. ES" 
END 
END 
END 
END 

PCPERFORM "ENDCAP" 
END 

NEW. FILE. NAME - FILE. NAME 

*1?pJS5rS'^^Pv'-^N^'S?H:FILE.NAME:- «: KEW . PATH : KZW . FILZ . NAME 

END 
END 

OSCLOSE ILM.FILE 
END 

ANSWER<1> « COUNT 
ANSWER<2> - ILM.ID 
REPEAT 
HHPEAT 
?.ETURN ANSWER 
» 

SST . STRING : 
20S.FLAG - I 

|c»p'"nNTIL (EOS. FLAG - 0 OR I - f^C-LEN*!)^ . 0 

IF SEQ(FILS-RECtRSC.P0S4-I-l,ll ) - 0 TSEN EOS.eUHv, 

"^ILMrSEcIlS.OTM> - FIL2.SEC(REC.?0S*I-1,1] 

^l5'?lc<ILM.KUM> - ILM.REC<ILM.NOM>:FILE.REC[REC.POS.I.1.X} 

END 
I -I-- 1 
RSPEAT 
•.ETDRN 

3ET . D0U3LE ; . 
EOS. FLAG 

3YTE1 = SEQ(FILE.?J:C(R£C.P0S^7 in 
3YTE1 « FMTlOCONV(ByTEl,''MB") , RCO).B ) 
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3'rrZ2 = FMT (OCONV {3yTE2 , "K3 " ) , ' ?i (0 ) #8 ' ) 

3VT£3 = S£Q(riLE.REC(R=:C.^OS-5-5,l) ) 
3VT£3 « rMT{OCONV(BYTE3, "MB") , 'R(0)^8') 

» £2Q{FILE.REC[R2:c.POS4.4. 1] ) 
3rrE4 « FMT(OCONV(BYTE4, "K3") , 'R(0}S8' ) 

3VTE5 = SEQ(FILE.REC[Ri:C.P0S+3, 1) ) 
5YTE5 FMT(OCONV13YTZS,"MB«),'R(0)#8') 

3YTE^ = SEQ{FILE.RSC[11SC.P0S+2,1) ) 
3YTE5 = FMT(OCONV(BYTEe, "MB") , 'RI0)#8') 

3YTE7 = SEQ(FILE.RECtREC.POS + l,l] ) 
3YTE7 = FMT(OCONV{BYTE7, "MB") , 'R(0)#8') 

3VTE8 = SSQ(FILE.REC[REC.POS, 1) ) 

3YTE8 « FMT(OCONV(3YTS8, "M3") , 'R(0)4i8' ) 

SIGN = BYTEl[l,Xl ^ 
BIASED. EXP ^ ICONV(ByTElt2,7j :ByTE2(l,4)."MB-) -1023 
MJ^jrriSSA » "l":BYTE2[5,4) :BrrE3;BYTE4:BYTE5:BYTE6:BYTE7:ByTE8 
MANTISSA = BYTE2[5,4] :ByTE3:ByTE4:BYTE5:BYTE6:BYTE7:BYTE8 

MAN. SUM - 0 
MULTIPLIER « 1.0 
^OR I ■ 1 TO 54 

IP MANTISSAlI,!) « 1 THEN MAN. SUM MANTISSA [1 , 1) ♦MULTIPLIER 
MULTIPLIER » MULTIPLIER/2 
KSXT I 

ILM.REC<ILM.NUM> « ( (-1) ♦*SIGN) M2**SIASED.EXP) MMAN.SUM) 
JTTURN 

!ET. INTEGER: 
EOS. FLAG 1 
I • 1 

LOOP UNTIL (I « REC.LEN+1) 
IF I « 1 TKSN 

1LM.REC<ILM.NUM> « SEQ (FILE. REC [REC .POS+I-1, IJ ) 
END ELSE 

ILM.REC<ILM.NUM> « ILM-REC<IU4.NUM>+SEQ(FILE.REC [REC.POS+I-1, U) ♦256 
END 
I 1 
P3PSAT 

•^TxniN 

:ST. GEAR. ARRAY: 

?OR I.C « 1 TO REC.LEN 

ILM.REC<ILM.NUM,I-C> « FILE .REC fREC. POS-f I . C- 1 , 1] 

^TEXT I.C 
-ETuRN 

lET . FLOAT . ARRAY : 

' THIS COULD fE MESSED UP, NOT CHECKED 
FOR I = 1 TO REC.LEN 

3YTE1 « SEQ(FILE.REC[REC.P0S + 3,13 ) 
BYTEl = FMT(OCONV(BYTEl, "MB") ,'R(0)«B') 
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3'h:Z2 o FOT (OCOlyV (BYTrJ K3 " ) , ' 0 ) #c ' ) 

3YTr3 = FOT(0C0KV{BYT£3, "MB") , •R(0)#S' ) 

3VTE4 - SiQ(?ILE.RSC(R£C.?OS,i: ) 

3YT24 « rMT(0C0NV(BYTE4, "MB"J , /R(0)#£') 

P-EC.POS ^« 4 
SIGN « 3YTEXU,1) 

BIASED, EX? « IC0NV(ByTSl[2,7) :3YTE2[1,1) .-M3*)-127 
MANTISSA « »1'':SYTE2[2,7I :SYTE3:ByTE4 

MANTISSA « BYTE2 12,7) :BYTS3:3YTE4 
MAN. SUM - 0 
MULTIPLIER « 1.0 
FOR J nr 1 TO 25 

I? KANTISSAU,!) « 1 THEN MAN. SUM MANTISSA (J, 1 J ^MULTIPLIER 
MULTIPLIER « MULTIPLIER/ 2 
NEXT J 

ILM.RSC<ILM.NUM.I> « OCX)NV ( ( (-1) •♦SIGN) M2* ♦BIASED. EXP) ♦ (MAN. SUM) ♦100. 'MD2' } 
NEXT I 
STORK 

:T . FLOAT : 

THIS COULD 3E MESSED UP, NOT CHECKED 
3YTE1 - SEQ(FILE.R£C(REC.POS-i-3,l) ) 
3YTE1 - FMT(OCONV(3rrEl,"MB"),'R(0)#8') 

3YTE2 - SEQ(FILE.REC[REC.POS-i-2,lJ } 
3YTE2 m FMT(0C0NV(BVTE2, -MB-) . 'R(0)#B' ) 

3VTE3 « SEQ(FILE.REC(REC.POS+l,l) ) 
3YTE3 - FMT(0C0NV(BrrE3, -MB") , 'R(0)#8' ) 

3VTE4 • SE0JFILE.R£ClREC.POS,lJ) 

3rrE4 « FMT10C0NV(3YTE4,-M3'') . 'R<0)#8') 

r^EC.POS +-4 
SIGN « BrrElIl,!) 

BIASED. EXP - IC0NV(BYTEH2,7] :3VTE2[l.ll,"MB")rl27 
MANTISSA - "1- :BYTE2 (2,7] :BYTE3:3rrS4 

MANTISSA - BYT£2C2,71 :BrrE3:3YTE4 
KAN. SUM - 0 
MULTIPLIER - 1.0 
FOR J - 1 TO 25 

IF MANTISSAtJ,!) « 1 THEN MAN.SUl'l +» MANTISSA (J. 1] *MULTI?LIER 

MULTIPLIER - MULTIPLIER/2 
ICEXT J 

ILM.REC<ILM.NUM> « OCONV( ({ -1) ♦♦SIGN) ♦ (2^*3tASSD .EXP) ♦ (MAN. SUM) ♦lOO, 'MD2 ' ) 
SrURN 

tr . LONG . ARRAY : 
FOR I - 0 TO REC.LEN-1 

LONG. SUM « SEQ(FILE.REC(REC.P0S + I'4. 1] ) t 2 56*SEQ(FILE.REC {REC.P0S+I*4-^1, 13 ) 
LONG. SUM ^ LONG. SUM ^ €553 6 'SEQ (FILE. REC (REC. POS* 1*4*2 , 1) ) * 1^777216 ♦SEQ (FI 
ILM.REC<i1m.NUM, I'»-l> » LONG.SUT-I 
NEXT : 
^Tw?J^ 
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LONG. SUM = LONG.S-JI-I ^ 6S-5J6*SEQ (FILE.R£C(REC.?0S^2, 1) ) t 1S77721S *SE0 (FILE .R 
II*M.R£C<ILM.NUM> « LONG. M 
^TURN 

•rST.INT.ARRAV : 

■ FOR r.GIA = 0 TO RSCLiN-X 

ILM.R£C<ILM.NU!-M.GIA+1> - S2Q (FILE .R£C IRSC. POS + I . GIA*2 , i] ) ^ 256*SEQ(FILS 
J7EXT I.GIA 
ISTORN 

ani, EXPORT: 

OPEN 'DOS' TO D.FILE ELSE STOP 
MNR.PATH « 'F:\DSERS\MCM\PFS\' 
1<NR* FILE. NAME » " 
CR « CHARC13) 
LF « CHAR(IO) 
D.REC « " • 

MNR. FILE. NAME = ILM. ID(4 , 7J » .TXT" 
D.REC « ILM.ID:CR:Lr 
FOR I « 1 TO ILM.Ri:C<27> 

> D.REC :« FMT(IKT(100*ILM.REC<96,I>/5.4704+.5) , 'MD2') : : F.KT (ILM.R£C<99 , 1 

NEXT I 

WRITE D.REC TO D. FILE, MNR. PATH: MNR. FILE. NAME 
LETTORN 
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The following is the Microsoft Excel vS.O source code for the macro program used to 
convert the JVB vLO database files as set forth in Example III. 
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• JVB Macro 

• Macro recorded 6/24/94 by Craig Miller 

• Keyboard Shortcut: Ctrl+j 

Sub JVBO 
OpenLine : 

MsgBox -Open the ILM Morphometry ♦.CSV file which you wish to separate, or end the macro by 
selecting the END. CSV file" 
ChDrive -C* 

ChDir "CAlKAGING.JVBSriLESXSTUDY.ILM- 
' OpenFiie • Application. CetOpenFilenameC "Text C.csv), •.=sv") 
Hor)cbooks . Open Filename :«OpenPile 
FileLength » Len (OpenFile) 
SearchStr » "MX" 

Period « InStr (OpenFile . SearchStr) 
OpcnFilel c Mid (OpenFile. Period + 2) 
FileLengthl » Len(OpenFilel) 
SearchStrl = " . " 

Periodl « InStr (OpenFilei . SearchStrl) 
OpenFile2 « Left (OpenFilel . Periodl - 1) 
CASENUM « Range (*'B2'*) .value 
If there are no values in the file, end the macro. If data is present, proceed 
Zf CJiSEUm « Then GoTo LastLine Else GoTo FormatLine 
Forma tLine : 

Application.ScreenUpdating ■ False 
Cells, Select 
Kith Selection 

.KorirontalAlignment « xlCcnter 

.VerticalAlignment - xlBottom 

.WrapTcxt - False 

.Orientation « xlKori^ontal 
End With 

Columns ("A: A") .Select 

Selection. Delete Shif t :«xlToLef t 

Columns ( * I : I " J . Select 

Selection. Cut 

Columns.CB.B") .Select 

Selection . Insert Shif t : exlToRight 

Columns ("0:0") .Select 

Selection .Cut 

Columns (*'AL:AL») .Select 

Selection . Insert Shift : -xlToRight 

Columns ("FiM") .Select 

Select ion. Cut 

Columns ("AL:AL") .Select 

Selection. Insert Shif t : exlToRight 

Columns ("AH:AH") .Select 

Selection. Cut 

Columns ("AB:AB"} .Select 

Selection . Insert Shift : «xlToRight 

Columns ("A: A") .Select 

Select ion. Number Format - "O" 

Columns ("B:AK") .Select 

Selection. Number Format « "0.000000* 

Selection . EntireColumn . AutoFit 

Columns ("E;E") .Number Format = "O.OO" 

Columns ("AB :AB" ) .Number Format « "0" 

Columns ( "AD: AD" ) .Number Format « "O.OO" 

Columns ("AEiAE") .Number Format " "0.00" 

Range <"A2») .Select 

Application.ScreenUpdating « True 

ChDrive "F" 

ChDir '•F:\USERS\MCM\40X'' 
Kith ActiveWorkbook 
.Title « 
.Subject e 

.Author = "Craig Miller" 
. Keywords » - " 
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.Commencs = ** 
End With 

Activewor)cboo)c. SaveAs Filename : =:OpenFile2 , FileFormat ; =xlNorn;al 
Password:*"-, WriteResPassword : s " " , ReadOnlyRecommended : « 
False , CreaceBackup : eFalse 
Application. Sc re enUpdacing « False 
CASENUM s Range ( '*A2 } .Value 
While CASEKUM <> 

Application. ScreenUpda ting c False 
Select all cells pertaining to this particular case 

y - 3 

While Cells (y, 1) « CASENUM 
y « y + 1 

Wend 

Select the cells for one case, copy them to a blank JVB template, 
and delete the first and last row (blank rows from template) , 
Range (Cells (2, 1), Cells (y - 1, 37)). Select 
Selection. Copy 

Workbooks .Open Filename : «"C: \CRAIG\ MACROS \JVBCMP.B1JC" 
Selection. Insert Shift : exlDown 
With Selection. Font 

.Name « "Times New Roman" 

. Fonts tyle ■ "Regular" 

.Size m 10 

. S trike through e False . 

.Superscript e False 

.Subscript « False 

.OutlineFont c False 

.Shadow « False 

.Underline k xlHone 

.Colorlndex « xlAutomatic 
End With 
With Selection 

.Horizontal Alignment - xlCenter 

. VerticalAlignment « xlBottom 

.WrapText « False 

.Orientation « xlHorizontal 
End With 

With Selection. Borders (xlLeft) 

.Weight « xlThin 

.Colorlndex « xlAutomacic 
End With 

With Selection . Borders (xlRight) 

.Weight » xlThin 

.Colorlndex - xlAutomatic 
End With 

With Selection . Borders (xlTop) 

.Weight « xlThin 

.Colorlndex « xlAutomatic 
End With 

With Selection. Borders (xlBottom) 

.Weight * xlThin 

.Colorlndex « xlAutomatic 
End With 

Selection . BorderAround LineStyle : sxlKone 
Columns ( "A: AK" ) , EntireColumn. AutoFit 
Rows (-3 : 3") .Select 
Application. Cut CopyMode - False 
Selection. Delete Shift :exlUp 

y « ^ 

While Cells (y, 1) « CASENUM 

y - y + 1 

Wend 

Range (Cells (y, 1), Cells (y, 37)). Select 
Selection -Delete Shift :=xlUp 
Make the first column the ceil number 
Range {"Al") .Select 
ActiveCell.PormulaRlCl = "Cell" 
Range ("A3") .Select 
ActiveCell.FortnulaRlCl » "1" 
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R£nge ( - ) . Select 

ActiveCell.FormulaRaCl = "2" 

Set Source o Range (Cells ( 3 , l) , Cells (4. 1)) 
Set FILL * Range (Cells n, 1), Cells (y - 1, 1)) 

Source -AutoFill destination : =FILL 

Sheets ( " JVBCMP" ) . Select 

CASENUM t: Trim(CASENUM) 

Sheets ("JVBCMP") .Name = CASENUM 

Range ("Al") .Select 

ChDir "F:\USERS\MCM\40X- 
With Activeworkbook 
.Title = 
.Subject « 

.Author e "Craig Miller" 
. Keywords ■ " 
.Comments » 
End With 

Activeworkbook. SaveAs Filename CASENUM, FileFormat : exlNormal . 

Password : B , WriteResPassword.*" " ** « ReadOnlyRecommended: b * 
False, CreateBackup:eFalse 

* Select the statistical Calculations, paste them to a blank spreadsheet, _ 

and cut and copy all of the values to a single row. " 
Range (Cells <y ••■2, 1), Cells (y + 37) ). Select 
Selection. Copy 

Selection. Pas teSpecial Paste :«xlValues, Operation :»xlHone, _ 
SkipBlanks : ^False, Transpose : «False 
Selection. Cut 

Workbooks .Add Template : •"Workbook" 

ActiveSheet .Paste ' 

Range ("Al"} .Formula « CASENUM 

Range ( "B2 ; AK2 " ) . Select 

Selection. Cut 

Range ("ALl") .Select 

ActiveSheet - Paste 

Range ("B3 :AK3"> .Select ' ' 

Selection. Cut 

Range ( "BVi - ) . Select 

ActiveSheet . Paste 

Range (**B4:AK4*') .Select 

Selection. Cut 

Range ("DFl") .Select 

ActiveSheet , Paste 

Range (-BS:AK5*) .Select 

Selection. Cut 

Range ( "EPl - ) . Select 

ActiveSheet . Paste 

Rows ("1:1") .Select 

Selection. Copy 

• Paste the statistics for the case into the summary file 

Workbooks . Open Filename : »"F : \USERS\MCM\4 0X\LATTIME. JVB*' 
Selection. Insert Shift : =xlDown 
Application. Cut CopyMode « False 
AC tive Workbook . Save 
Act iveWindow . close 
AC tive Workbook. Saved = True 
Act iveWindow. Close 
Ac tive Workbook. Saved <> True 
Act iveWindow . close 
Selection. Delete Shift :»xlUp 
Range CAS") .Select 
Application. ScreenUpdating ^ True 
•Continue looping the macro until all of the cases contained in this vector 
file have been separated and no data is left in the vector file. 
CASENUM n Range ("A2-) .Value 

Wend 

•If there is more than one vector file to separate, open the next file 

Ac t i ve v>'o rkbook . Saved « True 

Act iveWindow. Close 

GoTo OpenLine 
' End the nacro and refresh the screen 
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Active Workbook .Saved ■ True 

Accivewindow. Close 

Appli cation. ScreenUpdaring = True 
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The following is the source code listing of a computer program for the neural network 
as set forth in Example XII. 
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rile : 


sxgs in .c 




Concents : 


aack Propagation neural necwor)cs wich 
sigmoid, sinusoid and gaussian accivacion 




function. Data 


nomialized by mean and scd. 


Dace : 


Dec, X 1 0 i. 7 92 




7 i.ncluae 


<inac>h . h> 




FLXiclude 


<stdio . h> 






TRUE 


1 


#de£ine 




0 


rdef Ine 


EQUAL 


0 


rdef ine 




X 


X M ^ £ z 

raezxne 


UNSAKPL:uD 


0 


T define 


SINUSOID 


0 


rdefine 


SIGMOID 


1 


T define 


GAUSS I>^ 


2 


faexxne 


LINEAR 


3 








vdecxne 


RESUM 


0 


^define 


SIG(x) 


1.0/(1.0 + exp( (double) (-x/ 10.0) ) ) 


r define 


SIN(f, X) 


sin( (double) (f *x) ) 


vaecxne 


GAUS (X) 


exp( (double) (*x*x/40.0)) 




SLOPE (f, x) 


cos ( (doisble) (f *x) ) 


^define 


MaYPtns 


1000 /*Max training + cesc patterns*/ 


#def ine 


MaxLayerK 


2 /*M&x LayerK in configuration*/ 


#define 


HaxWtM 


40 /*M&x weights per nuron*/ 


^define 


M&xStrLen 


100 /•Marimuxa String Length*/ 


*def ine 


MaxInputK 


20 /•Maximua data input lines*/ 


#def ine 


NxtLn(s, rile) 


fgets ( (char*)s,KaxStrLen, (FILE*) File) 



/»Networ)c configuration 
and control parameters*/ 



int 
int 
int 
int 
int 
int 
int 
float 
float 
float 
float 
float 
int 
int 
int 
char 
char 
char 
char 

/*Inouc 

float 

float 

float 

float 

float 

float 

char 

Char 



TrainPtns - 1$; 
TestPtns - 16; 
InpuCH - 4; 
OutputN - 1; 
LayerN - 2; 

KeuronlnLayer (KaxLayerN} - (B, 
ActlnLayer (MaxLayerK) - {1^ 1)< 
LeamRate - 0.5; 
Threshold - 0.01; 
Frequency - 3.0; 
InitialRange - 2.0; 
InputScale - 1.0; 
RptRate « 20; 
MaxLoops - 2000; 
RandSeed - 12345678 9; 
TrainFile [20] - "Traing"; 
TestFilefZO] - "Ttest"; 
ReportrilcC20) - "Report"; 
Networ)crile[20) - "Network"; 



data*/ 

*DataIn (MaxPtns) ; 
•DataOut [MaxPtns] ; 
mean ^axInputK) ; 
std(MaxInputN] ; 
•Input Layer; 
'Target; 

Ti^ainPref ix (MaxStrLen) 
TestPref ix (MaxStrLen ) • 



11. 



/•Total training patterns*/ 

/•Total test patterns*/ 

/•Total input nurons*/ 

/•Total output nurons*/ 

/•Total LayerN*/ 

/•Total nurons in each layer*/ 

/•Action function confige*/ 

/*Leaming rate*/ 

/*Terziiinati.on condition*/ 

/*Sinusoid frequency*/ 

/* Initial range for network*/ 

/•Input Normlizacion Factor*/ 

/•Reprot rate^/ 

/•Maxinum loops*/ 

/•Random seed*/ 

/*Traingin data file narxe*/ 

/♦Test data file najoe*/ 

/♦Output dowenload file*/ 

/*rinal NH sctruccurc*/ 



/•Normalized training patterns*/ 
/•Normalized training patterns*/ 
/•Input data mean buffer*/ 
/•Input data std buffer*/ 
/•Array of input nurons*/ 



/cdi-data2/" ;/ •Train data file prefix*/ 
. /cdi-data2/";/'-Tesc data file prefix*/ 
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rypedcf Swruct Neuron { 

Wt(MaxWtN); 
floa. 3iase; 
float Sum; 
fioac Out; 
float Error; 
float Delta; 
int Activation; 
) NEURON; 

KZURON *Layer {KaxLayerN ] ; 

char •'KELP -s "Usage: sigsin NewExpFlag . Nn" ; 

main (argc^ argv) 

int argc; 

bhtr *»argv; 
I 

Int Loops, Pattern; 

int Spool (KaxPtns] , RandSaanple; 

float SnmF.rror; 

double drenci<8 0 ; 

'^oid Initialization () ; 

void Resume ); 

void Output (}; 

void Bac^cPropagation (} ; 

void DumpOutput ( ) ; 



if C(argc«-1) | I (atoi (argv(l] ) — NEHEXP) ) 

Initialization ( ) ; 
else if (atoi(argv{l}> — RESTJM) 

ResuaO; 

el5e( 

printf ("%s\n"r HELP); 

exit(O); 

) 

Loops • 0; 

SuaError - (float) HUGE; 

while ( (Loops -H- <- HaxLoops) 4t .(SumError >- T^ireshold) ) ( 

f or (Pattcm-0; Pattern" < TrainPtns; Pattern-M-) /'KarJc as imsairpled* / 
Spool [Pattern) - UNSAMPLED; 

SuiaEcror " 0.0; 

£or(Pattem • 0; Pattern < TreinPtns; Pattem-M-) { 

RandSeaole » TrainPtns * drand48(); /•Randoaay sample patns*/ 

while (Spool (RajidSanple 3 — SAMPLED) 

RindSaaple - TrainPtns • drand48 (); /•Sampled? try other*/ 
Output (RandSaople) ; 
Bac>cPropagation ( ) ; 

DumpOutput (Pattern/ Loops, RandSample, £ SuaError) ; 

Spool fRandSampie) - SAMPLED; /-K&rked as seispied*/ 

1 

1 

if CTestPtns) ( 

SumError « 0.0; 

for (Pattern « TrainPtns; Pattern < TestPtns + TrainPtns; Pattem-H-) ( 
Output (Pattern) ; 

DumpOutput (Pattern, -1, Pattern, i SumError I ; 

) 

) 

exit 0 ; 
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Routine: Initiaiixation ( ) 

Get ir. argixments, allocate -sniDry, 

void Initialization () 
{ 

void ReportAx? ( ) ; 

void GetArgumenrs (} ; 

void XllocateMemory 0 ; 

void GctData () ; 

void sr&nd<8 () ; 

void InitialHetwoEX ( ) ; 

void NosoData ( ) ; 



GetArg^iments ( ) ; 

ReportArg (KEWEXP ) ; 
.XllocateMemory ( ) ; 
'GetDataO; 

Hoz3nData ( ) ; 

sraAd46 (RandSeed) ; 

Xniti&lKetworkO ; 

) 



Routine : GetArguznents ( ) 

Collect control and network 
configuration parameters . 



void GetArguments ( ) 
{ 

int ir 'iP/ tjnp; 

char TnoC, TmpS[X001; 

FILE •XrgFile; 
void ReadXnStrO; 



if ((XrgFile - fopenCarg", "r")) — NUXL) ( 

printf ("cannot open file arg to read\n") ; 

exit(l); 

) 

while ( (Tape - getc (XrgFile) ) ! - EOF) ( 
ungecc (IiopC, XrgFile ) ; 
Re&dZzi5tr(TinpSr XrgFile); 

if (srrc3np(TBpSr "Training Data File ;") —EQUAL) 

. ReadlnStr (TrainFile, XrgFile); 
else if (strcxnpCTmpS, "Test Data File: -) —EQUAL) 

ReadlnStr (TestFile, XrgFile); 
else if (strciio(TrnpS, "Report Data File —EQUAL) 

ReadlnStr (ReportFile, XrgFile) ; 
els^ if (strcixo(TinpS, ••Networ)c Structure File : ") — SQUXL) 

ReadlnStr (Networ>;rile. XrgFile) ; 
else if (strcma(TmpS, "Training Patterns EQUAL) i 

f scant (XrgFile, "^d", &TrainPtns) ; 

NxtLn(TmpS, XrgFile) ; 
J 
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else ir (St rem? (TnpS, "Test Patterns —EQUAL) i 
fscanff T^'i^' "^d", STestPtnsJ ; 
. KxcLn (1 3, ' gFile) ; 
) 

else if (strcn» (TmpS, "Input Neurons —EQUAL) { 
fscank (Argrile, "^d", filnputM) ; 
NxtLn (Taps, ArgFile) ; 
) 

else if Istrcmp (TnipS/ "Output Neurons :•*) -^-EQUAL) 1 
fsc&nf (ArgFile, "%cl", iOutputW) ; 
NxtI-n(TnpS, ArgFile); 
) 

else if (5trcxnp(TnipS, "Layers —EOaAL) I 
fscaJv£(AxgFile, "%cl", tLaycrN) ; 
KJCCLn(TinpS, ArgFile); 
J 

else if (strc3tip(TinpS, "Neurons in Lawyer:*) ^^ZQVtJ.) i 
£or(i-0; i<LayerN; i++) 

f sc&nf (ArgFile, " tNeuronlnLeyer (i] ) ; 
Hx&Ln (TxnpS , ArgFile) ; 
) 

else if (strcmpCTnipS, "Activation in Layer :") •—EQUAL) ( 
for(i-0; i<LayerK; i-H-) 

fscanf (ArgFile, "%d"r tActlnLaycr [i) ) ; 
Nxtl-n (TmpS, ArgFile) ; 
) 

else if (strcinD(TnipS, "Learning Rate:")— EQOAL J ( 
fscaxii (ArgFile, "%f", fcLearxiRate) ; 
Nxtl-nCTmpS, ArgFile); 
) 

else if (strcinp(TinpS, "Random Seed: EQUAL) { 
fscanf (ArgFile, "%d", tRaadSeed) ; 
NxtLn (TxnpS, ArgFile); 
) 

else if (strcrip(TmpS, "Input Scaling: ") —EQUAL) ( 
fscan'f (ArgFile, "%f", t Input S ca le ) ; 
if c Input Sc&le—0 .0)1 

printf ("Warning : 0, scaling factorXn"); 

exit (0) ; 

) 

NxtLn (TnoS, ArgFile) ; 
) ' . 

else if (strciap(TMS, "Ending Threshold:")— EQUAL) ( 
fscanf (ArgFile, " Vf " , ^Threshold) ; 
NxtLn (Tx&pS, ArgFile); 
} 

else if ( strcBip (TapS , "Frequency : " ) —EQUAL) ( 
fscanf (ArgFile, "%f", t Frequency) ; 
NxtLn ( Tsu>S r ArgFile ) ; 
} 

else if (strcmp(TfflpS, "Initial Range: ") —EQUAL) ( 
fsc£nf (ArgFile, "%f", tIniti&lRaage) ; 
KxtLn (TsdS, ArgFile) ; 
} 

else if (strcmp(TnipS^ "Report Rate :") —EQUAL) ( 
fscanf (ArgFile, "%d", ARptRate); 
NxtLn (Taps, ArgFile) ; 
) 

else if (strcxt^(TmpS, "Maximum Loops :") —EQUAL) { 
fscanf (ArgFile, "%d", tMaxLoops) ; 
^ NxtLn (TinpS, ArgFile); 
I 

1 

f close (ArgFile) ; 
I 
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r.oucrne: Re&dXnScr 

Redd into string, disgard newline character 

• / 

void Rc&dlnStr (Str, InFile) 
ch-ar 'Str; 
rXl*i •Incile; 
( 

int Len; 

frets tstr, KaxStrLen, InFile); 

T(Str + strlen(Str) - 1) - 'NO'; /*Disgard newline and p&d null*/ 

) 

Routine: ReportArg 

'p Reporte the input argiunent3. 

«<r*»*»«, 

"c-oid ReporcAxgCNewExpFiag) 
iAt NewCxDFlag; 
{ ' 
register i, »ip; 

riLE »R?tFile; 

Lt CNewExpFlag) ( 

if < (Rptrile-f open (ReportFile, "w" ) ) —NULL) I 

printf ("cannot open file %s to write\n", ReportFile) ; 

exit(l) ; 

} 

) 

else( 

i£ ( (?totFile-f open (ReportFile, "a")) —NULL) I 

printf ("cannot .open file %s to appendXn", ReportFile); 

exit (1) ; 

1 

) 

if (!KewCxpFlag) 

fpriJitf (RptFile/ "\nRESUMED FROM THE PREVIOUSLY STOPED PROCESS NnSn" ) ; 
f print f (RptFiler "Training Data File: %s\n"/ TrainFile) ; 
fprintf (RptFiler "Test Data File: ^sXn**, TestFilc) ; 
fprint£(RptFile, "Report Data File: %5\n"r ReportFile); 
fprintf (Rptrile, "Network Structure File: %s\n'', NetworXFile) ; 
fprintf (Rptrile, "Training Patterns : %d\n", TrainPtns) ; 
fprintf (RptFile, "Test Patterns: %d\n", TestPtns); 
fprintf (Rptrile, "Input Neurons: %d\n", InputK) ; 
rpriirtf (Rptrile, "Output Neurons: td\n", OutputN) ; 
f?rij:t£{RptFile, "Layers: %d\n", LayerN) ; 
f?rint£(RptFile, "Neurons in Layer: ") ; 

ip * NeuronlnLayer; 
for(i«0;. i<lAyerN; i++) 

f print f (RptFile, "%d\t", •ip+-+) ; 

fprintf (Rptiiie, "\n") ; 

f print f (Rpt File, "Activation in Layer: "J; 
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•0; KLeyerN; i-i--*-) 

fprintf (Rptrile, d\*?, •ip-r+) ; 



iprintf (RptFile, "\n") ; 

fprir.rf (RptFile, "Learn Rate: %f\n", Le&rnRate) ; 
fprir.ti (Rptrile, -Random Seed: %d\n", RandSeed) ; 
fprir.rf (Rptrile, "Input Scaling: %f\n", InputScale) ; 
fprir.if (Rptrile, "Ending Thresnoid: ^f\n". Threshold); 
rprini-f (Rptrile, Treouency: \n". Frequency) ; 
rprir.if (Rptrile, "Initial Range %f \n" , IniCialRange) ; 
f print f (Rptrile, "Report Rate %d\n"r RptRate) ; 
f print f CRptFile, "Maxiauxn Loops *d\n", HaxLoops); 

f close (Rprrile) ; 
} 



Routine : GetData 

Read in training and test data from file 

w .«...* .•.../ 



void 
( 

reg-ister 
float 
char 
FILE 



GecData ( } 



*fpl, *fp2r ft«p; 
♦TnipStr; 
*Datarile ; 



/•Get training data*/ 

TTKJStr (char *) strcat (TrainPref ix, TrainPile) ; 
if\CDatarile " f open (TmpStr, "r") ) ~ NULL) ( 

printf ("cannot open file %» to read\n", TnipStr) , 

exit (1) ; 

) 



fcr(i-0; KTrainPtns; i++) I 
fpl - Detain [i]; 
fp2 - DataOut (i] ; 
for(j-0; j<InputN; j-M-) ( 

fscanf (DetaFilc, "%f", 4f tnp) ; 

•fpl-*-+ « f tap /Input Scale; 

} 

for(j-0; j<OutputN; j++) 

fscanf (DataFile, "%f", f p2++) ; 

) 



/*<;et test data*/ 

r=pStr - (char •) strcat (TestPref ix, TestTile) ; 
if (TestPtns) { 

if ( (Dat.arile - f open (2it©Str, "r") ) — KULL) { 

priatf ("cannot open file %s to readXo"/ lapStr) ; 

exit (1) ; 

} 

for (i-TrainPtns; i<TrainPCns + TestPtns; i-M-) { 
fpl Dataln(i} ; 
fp2 « DataOut (i); 
for(j-0; j<InputH; j h^) ( 

fscanf (Datarile, "^f", iftmp) ; 

•fpl++ - f tn»/XnputScalc; 

I 

for(j-0; j<OutputN; 
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) 

) 

) 

Routine: NormDate 

Korrialize input data 
to range [0,1] . 

void NorznDataO 
( 

int j, pens; 

float ftmp; 

TllX •RptFile; 

pens • TrainPtns; 
for(i»0; i<InputN; { 

ftmp - 0.0; 

for(j-0; j<ptns; j-H-) 

ftmp +- Datalnl jl lil ; 

mean(i) « ftnip/ptns; 

} 

for(i-0; KinputN; i-H-) { 
ftxop * 0.0; 
for(j»0; j<ptns; j-H-) 

ftap -H* pow( (Dataln( j] [ij - mean (ij ) ^ 2.0) ; 
stdCi) - pow( (fuao/ptnsK O.S) ; 
1 

pens « TrainPtns + TestPtns; 
for(i-0; i<lnputN; i-H-) 

for{j»0; j<ptns; j-H-> 

Dataln[j)til - (Datain [ j] ( i] -n^an li) ) /std[i) ; 

if{{Rptrile - fopen<RepoctFiie, - a - ) ) ~Ntn*L) ( 

printf C^cannot open fiie ts to writeVn", ReportFile) ; 

exit (0) ; 

) 

f priritf (Rptrile, "MeanXtSTDXn") ; 
for(i«0; i<InputM; i++) [ 

f print f (RptFilcr "tfNf, xaean[il); 

fprintf (RptFile, "%f\n-, stdtil); 

printf (-%f\t-, nean(il); 

printf (••%f\n"r std[i3); 

) 

fprintf (KptFile, "\n\n") ; 

zprintf CRptFile, " Loops \tPtns\tNN-Out\tTarget\tSq~Err\n") ; 

f Close (Rptrile) ; 

) 

Routine : AliocateMemory 

Ai.locate memory to nuron 
structure a-nd other arra/s 

void XllocateHemory () 

I 
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-agister JL, 

/•^ Allocate space 
for training data*/ 

for(i*0; i<TrainPtns + TestPtns; i++) ( 

if ( (Datalntij « (f ioat • ) calloc ( InputN, sizeof (float ) ) )««»NUI.!-) I 
printf ("cannot allocate memory to DatalnXn") ; 
exit (1) ; 
) 

if ( (DataOut [i] - (float*) calloc (OutputN, sireof (float) )) ~KULL) { 
print f ("cannot allocate memory to DataOut \n" ) ; 
exit (1) ; 
} 

} 



/*Allocate space for 
ijiput layer nuroas*/ 

if ((Input Layer - (float •) calloc (InputN, sireof (f loat) ) > --WLL) ( 
printf ("cannot allocate memory to InputLayerVn"*) ; 
exit(l); 



/•Allocate space for nurons 
in the rest of Layers*/ 

for(i-0; i<LaycrN; i-H-) 

if ( (Layer (i)«{MEUKON •) calloc (NeuronlnLayerf i3r sizeof (KEUROH) ))— NULL) ( 
print f ("cannot allocate xaeaory to Layer (%dl \n"^ i) ; 
eatit (1) ; 
) 

/^Allocate space for target values*/ 

if ((Target • (float »> calloc (OutputNr sizeof (float) )) —NULL) ( 
printf (•'cannot allocate memory to ffargetXn") ; 
exit(l); 
) 

) 

/ • — 

Routine: InitialNetwor)c 

Initialize the activation function configuration for 
each layer (nuron) and nuron structure in t^e netvor)c. 



void InitialNetworkO 
{ 

register i, j, k; 

float ♦fp; 

NStJRON »NeuronP? 
double drand4e(); 

for(i«0; i<LayerN; i++) { 

for(NeuronP - Layer(i)^ j-0; j<NeuronInLayer (i) ; NeuronP++, j-M-) { 
^ NeuronP->Activation « ActlnLayer (i] ; 

NeuronP->Biase « (dxand48 0 * O.S) • InitialRange ; 
NeuronP->Suro - 0.0; 
NeuronP->Out =0.0; 
NeuronP->Error - 0.0; 
fp - NeuronP->wt; 
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*or(Jc-07 )c<InpucN; >c-r+) 

•£p'r+ « (drancS48() - 0.5) * iW»**^e; 

else 

for(k-0; k<NeuronInLayer ( ; )c-i-+) 

•fp++ - (drand^BO - O.S) * initialRange ; 



) 



Rout ine : Re adXuHe t wo r )c 



Initialize the activation function configuration for 
each layer (nuron) and read in the networh weight 
values from stoped process. 



void ReadlnSetwork () 
( 

^agister i, X; 

char Header, Ctn^; 

f loat •£p; 
KEUKON *HeuronP; 
FILE *NetFp; 

if ( (NctFp-fopen (Netvor>:File, "r"))— KULL) ( 

printf(« cannot open file file %s to read\n", Networ>:File) ; 

exit(l); 

1 

Header * 7 + InoutN; 

whiJ.e( — Header)! /*Read off networ)c file header*/ 

f scanf (NetFp, "^c'\ fiCtno) ; 
while (Ctiap!-' \n' ) 

f scanf (NetFp, "%c", tCtxnp) ; 

J 

forli-0; KLayerK: !■*-+) ( 

£or(NeuronP - Layer(i], j-0; j<NeuronInLayer [ ij ; NeuronP++, j-M-) ( 
NcuronP->Activation - ActlnLaycr (i] ; 
fscanf (NetFp, "%f", t (NeuronP->Bia5e) ) ; 
fp - NeuronP->Wt; 
if(i~0) 

for()c-0? )c<lnputN; )c++> 

f scanf (NetFp, "\f", f p+-r) ; 

else 

for()c-=0; >:<NeuronIiiLayer(i-13 ; >t++) 
f scanf (NetFp, f p-H-) ; 



) 

f close (NetFp) ; 
) 



Routine: ResumO 



Resum a stoped process. 
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void <^esuin() 
{ 

void ReportArgO; 

void GetXrg-uxnenLS ( > ; 

void AJLiocateMemory () ; 

void GetDatAO; 

•••oid IniciaiNetwork { ) ; 

void NomDAta{j; 

Get,ArgujtienLs () ; 
aeportArg (RTSCJM) ; 
AJ.locaceHeaory 0 ; 
GetDetaO; 
HocxDOata 0 ; 
Re&dInNetwo£)c 0 ; 
) 



Roucine : Ourpu^ 

Calculate the output for each nuron. 



void Output (Pattern) 

Int Pattern; 

[ 

int ir j; 

float mp, rz: 

KSURON •NeuronP ; 

void GetPattemO; 
float GetSuaO; 

GetPattem (Pattern) ; 

for(i-0; KL^yerS; i-M-) ( 

Neuron? - Layer (i); 

£orlj-0; j<NeuronInLayer(i] ; NeuronP-M-) { 

rt • GetSumCi, j) ; 
Txnp - rt + NeuronP*>Biase; 
NeuronP->Suxa - Trap; 
switch (NeuronP->Activacion) ( 
case SIGKOZD: 

MeuronP-X)ut " SIG(Tna>); 
break; 
case SIKUSOZD: 

NeuxonP*>Out « SIN (Frequency, Tap); 
bre&)c; 
case GAUSSIAN: 

NeuronP->Out « GAUS (Txnp) ; 
breaJc; 
case LIKEAR: 

MeuronP->Out « Tnp; 
break; 

default: 

printf ("invalide activation function type\n"). 
exit(O); 

) 

) 

) 

/• • • 

?.outine: GetPattem 
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w» 

void GetPattern (Pa-tern) 

int Pattern; 

( 

register i, j; 

float *tpl, "fpZ; 

£pl - XnputLayer; 
fp2 - Dat&In (Pattern] ; 
for(i-0; i<InputiN; i-r+) 
*fpl++ - •fp2-f+; 

fpl « Target; 
fp2 " DataOut (Pattern] ; 
for(i-0; KOutputN; i++) 
•fpl++ « •fp2++; 

J 

f(outine: GetSumO 

Calculate the sxim input for each nuron 



float Get.Suffl(LayerIndex, Neuronlndex) 
Inc Layerlndexr HeurorvXndex; 

( 

register i, j; 

float Sum, "fpl, •fp2; 

KZOROK *NeuronPl, *NeuronP2; 



if (LayerXndex — 0) { 

NeuronPl - (NTUROM •) (Layer (Layer Index] + Neuronlndex); 
fpl - InputLayer; 
fp2 - Neuroapi->Wt; 
for (Suxn«0 . 0^ i-0; i<InputN; i-*-*) 
Sua (•fpH-+ * •fp2'r+) ; 

) 

else{ 

NeuronPl - (KZUROH *) (Layer (Layer Index) + Neuronladex) ; 
NeuronP2 - Layer (Layer Index - 1); 
fpl - Neuronpi->Wt; 

for (Sxim-0 . Or i-0; iCNcuronlnLayer (Layer Index-1] ; i-H-, NeuronP2'f+) 
Sum +- (NeuroixP2->Out ' * •fpl-H^) ; 

) 

rerum(Sizm); 
) 

B-ourixie : BackPropgation 

Modify network through bachpropgation . 

,......../ 

void BackPropagation () 
( 

void GetDelta () ; 

void OpdateWt () ; 
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U?dat€Wt 0 ; 
I 

r. 

Roucine: GecDeita 

Calculate delta values for each nuron in the 
network, out layer and internal layer ; 



void GetDeltaO 
( 

register i; 

void OutLayerDelta () ; 

void InrerLayerDelta () ; 

OutLayerDeltaO ; 

^or (i-I*ayerH-2; i>~0; i — ) 
InterLayerDelta (i) ; 

) 



Routine: OurLayerDelta 

Calculate output layer delta 

void OutLayerDeltaO 
( 

register i; 
float TapOut; 
NEURON *NeuronP ; 

for(NeuronP - Layer (layerN-1 ) , i-0; iOJeuronlnLayer [LaycrN-l] ; NeuroRP+-f, i+-»-) ( 
TnoOut - NeuronP->Out; 
NeuronP->Error « Target (i] - TxnpOut; 
switch (NeuronP*>Activation) ( 
case SIGMOID: 

NeuronP->Delta TinpOut * (1 - TmpOut) • NeuronP->Error; 
break; 
case SINUSOID: 

NeuronP*>Delta -SLOPE (Frequency, NeuroaP->Su3i) * NeuronP-->Er 
break; 
case GAUSSIAN: 

NeuronP->Delta - -NeuronP->Suja « TnpOut • NeuronP->Error/20; 
break; 
case LINEAR: 

NeuronP->Delta « NeuronP->Error; 
break; 

default : 

^ printf Cinvalide activation function type\n") ; 

exit (0) ; 

) 

) 

I 
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?.out.ine: IncezLe D' "'a 

Calculate internal layer Delta 
- ^ 

void InterLayerDelta (Layerlndei:) 

Int LayerXndex; 

{ 

register i; 
float TmpOut; 
KiURON «NeuronP ; 

float NeuronError ( ) ; 

for (NeuronP-Layer I Layer Index) , i«0; i<NeuronInLayer[ Layer Index] ; NeuronP++, i+-r) ( 
TapOut - NeuronP->Out; 

NeuronP->Error - NeuronError (Layerlndex, i) ; 
switch (NeuronP->Activatioii) { 
case SIGMOID: 

NeuronP->Delta ^ TnpOut • (1 - TmpOut) * NeuronP->Error; 
bre&k; 
case SXKUSOXO: 

NeuronP->Delt.a -SZiOPE (Frequency , NeuronP->Sum) * MeuronP->Error; 
break; 
case GXassXAN: 

NeuronP->Delt.a « -2 * NeuroiiP->Suia * TmpOxiZ * NeuronP->Error; 
break; 
case LINEAR: 

NeuronP->Delt.a - NeuronP->Error; 
break; 

default: 

prlnrf ("invallde activation function typeSn"); 
exit: (0) ; 

) 

) 

) 

Houtiae: NeuronError 

Calculate error for each nuron in the internal LayerN. 

«.«»««ir*««*****«*«*««*^**»*i.«*«**«**»*»««* 

float NeuronError (LayerXndex, Neuroalndex) 

±jxz La ye r Index r Neuron Index; 

I 

register i; 
float Error; 
NEURON *HeuronP; 

Error "CO; 

for (NeuronP-LayerCLayerlndex+l] , i«0 ; i<NeuronXnLayer [Layer Index+1) ;NeuronP-M-, i-M-) 
Error +- (NeuronP->wt [Neuronlndex] •..NeuronP->Delta) ; 

reitum (Error) ; 
) 

.WW. **«*««*r WW. 

Routine: UpdateWt 

Modify the weights associated with each nuron. 



-144- 



wo 97/12247 



PCT/US96/15780 



vo id Upda t e wc { ) 

register i; 

void rirSwLayerWt ( ) ; 

void KesLL&yerWt () ; 

rirstLayerWt C) ; 

for(i"l; i<LayerM; i-r+) 
ResCLayerWt (i) ; 

) 

■loucine: First LayerWt 

Modify the weights reXated to the first layer nurons. 
./ 

-void FirstLayerWt 0 - 
( 

register i, j; 

floa t »fplr •fp2; 

NOTROM *NeuronP; 

forCNeuronP - LayerfO), i-0; i<NeuronIftLaycr (0) ; NeuronP++, i++) [ 
fpl « InputLayer; 
fp2 - NeuronP->Wt; 
for(j«0; j<InputN; j-M-) 

*fp2-t-+ LeamRate * NeuronP->Delta * •fpl++; 
NeuronP->Biase (LeamRate * NeuronP->Deita) ; 
\ 

} . 



/ «**«»»*«r« 

Routine: RestLayerWt 

Modify the weights related to the nurons of other LayerK. 

• 

void RestX«ayerHt (LayerXndex) 

{ 

register i^ j; 

float 'fp; 

KEORON *KeuronPl, *NeuronP2; 

for (NeuronPl-LayertLayerlndex) ,i-0;i<HeuronIaLayerCLayerIndex] ;NeuronPl4-r, i++) { 
fp - HeuronPl->Wt; 
KeuroaP2 - Layer [La ycrXndex 1} ; 
for(j-0; j<WeuronXaLayer[ Layer Xnder -1]; j-H-) { 

*fp++ +- (I*eamRate*NeuronPi->0eita*NeuronP2->OutJ ; 

NeuronP2++; 

1 

Neu^onPl->Biase (LeamRate * NeuronPl->Delta) ; 
) 

) 
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Rcucine: DuinpOut*^»'C 

Save output , report file £na rturn it to nu .1. 



VDid DuKipOutput (PtnCnt , Loops, ?tnSampled, SuiaSrror) 

float 'SumError; 

inr Loops, PtnCnt, PtnSamoled; 

{ 

register i; 

float SqError^ OutError, ♦fp; 

KEUROK *NeurorvP; 
rll-E •Rptrile; 
void DumpNetwor)c{) ; 

fp - DataOut(PtnSainpled) ; 
NeurozvP - Layer (LayerN-lJ ; 
ScError * 0•0; 
f or (i«0; KOutputN; fp-*-+r NeuronP++, i++) { 
OutError « *fp - NeuronP->Out; 
SqError +- Out Error •OutError ; 
J, ) 

'•SuiaError +- SqError; 

if ({Loops * RptRate — 0) || (Loops — -1) ) ( 

if ( (RptFile-f open (ReportFile. "a") ) —NULL) ( 

print f ("cannot open file %s to append. \n", ReportFile) ; 

exit(l); 

) 

fprintf CRptFile, "^dXttdNt", Loops, PtnSaiopied) ; 
NeuronP - Layer (La yerN-1) ; 

for (NeuronP-Layer (LayerN-ll , i-0; i<NeuronInLayer(LayerN-13 ;NeuronP-M-, i++) 
fprintf (Rptrile, -t6.4f\t". Neuron? ->Out ) ; 

fp - DataOut [PtnSaapled] ; 
for ( i-0 ; i<OutputN ; i++ ) 

fprintf (RptFile, "%6.<f\t-, *f?'HK) ; 

fprintf (KptFile, "%6-4f Vt", SqError) ; 

fprintf (RptFile, -\n*) ; 

f close (RptFiie) ; 

) 

if ( { (Loops %RptRate~0) ££ (PtnCnt— TrainPtns-l) ) I I (PtnCnt— TrainPtns+TestPtns-1) ) i 
if ( (RptFile-f open ( ReportFile . " a " ) ) —NULL) { 

pcintf ("cannot ooen file %s to aopendAn", ReoortFile) ; 

exit(l) ; 

} 

fprintf (RptFiie, "SunError %6.4f\n\n\n", •SumError) ; 
f close (RptFiie) ; 

DunoNetworkO ; 
) 

) 



Sloutine : DumpNetworJc () 



Save the final networlc configuration r including networ}c 
parameters (layers, layer configuration, activation function 
configuration), weights, biases. 
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void DunpNetworh ( ) 
( 

register i, j, k, WtK; 

float. *fp; 
N-EURON •Ncurcr-P; 
FILE •NetFiie; 

if ( (NetFile«£open (NetvorJcFile, "w") ) ~KULL) ( 

printf ("cannot open file hs to writeXn", NetworkFile) ; 

exit(l); 

} 

fprintf (NetFile, -%cl\ttd\n", InputN, OutpucN) ; 
fprinrf (NetFile, "WXn", LayerN) ; 
fpr(i«0; i<LayerN; i+-^) 

fprintf (NetFile, -%d\t", NeuronlnLayer [i)) ; 
f printf CNetFile, "\n") ; 
for(i-0; i<LayerN; i-H-) 

fprincf (NetFile, "%d\f , Act InLayer [i) ) ; 
fprintf (NetFile^ "Vnifi. 4f \n**r Frequency); 
tor(i-0; KlnputN; i++> ( 

^ fprintf (NetFile, "^fSf , meanri)) ; 

fprintf (NetFile r •'%f\n", scd(i]); 

1 

fprintf (NetFile r "VnXn") ; 

for (i-0;i<LayerN;i-M-) ( 

WtN - i~0 1 InputN : NeuronlnLayer (i-13 ; 

for(NeuronP-Layer [i] r j"0; j<NeuronInLayer(i] ;NeuronP-M-, ( 
fprintf (NetFile, -NntS . 4f \n", NeuronP->Biase) ; 
for (fp-NeiironP->Wt, )c-0; )t<WtN; k-M-) { 

fprintf (NetFile, "%6-<f •fp'H-) ; 

if()c%8— 7) 

fprintf (NetFile, " \n" ) ; 

) 

) 

) 

f close (NetFile) ; 
) 
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The following is ihe source code listing of the CellSheet* v. 1.0 computer program as 
set forth in Example XIII. 
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Cell Sheet vl.Od Source Code 



< CftllShftfttD Haero 

• Macro recorded 2/S/BS by Craig Millar 

I 

' Keyboard Shortcut: Ctrl4b 
f 

Sub CellSheetDO 

• Open a Call Shaat vl.od Betal ♦.CSV file from tba C:Mmag»g. JVB\rilM\study.ILM _ 

subdirectory only, otber visa you wust change the SearchStr value so that ^ 
it equals the last character in the nai&e of the subdirectory, ^ _ ^ ^ 

Msg • ■Bave you converted all of the listmode files using the Cell Sheet vl.Od Program and 
all of that programs available paraaiacers, and then escorted the data to a *.CSV file?" 
Style ■ vbYesNo 

Titla - 'Listmode Conversion Ststtu* 

Status m MsgBoxCKsg, Style, Title) 

XZ Status ■ vhKo ihen goto findLlna Else 

Msg m "Have you already created the fozmatted excel spreadsheet and you just need to create 
a stats filer " 

Style m vbYesllo ^ ^. 

Title • •create Only Stats Pile from Existing Spreadsheet?" 
StAtusl • MsgBox(Msg« Style, Title) 
Zf Statusl m vbYes Then GoTo BadStart Else 
OpsnXiinat 

MsgBox •open the ♦.CSV file which you with to convert:* 
CU)rive "C" 

ChDir •Ci\IMW5IHG.JVB\niiES\STnDY.IlM^ 

CSVFile - Application.aetOpenFilena»e{"TeJCC (••csv), ♦.csr") 
Z£ CSVFile - False Then Goto EndLlne Else 
Horkbooks.Opan Filename t»CSVFile 
pileX-ength - X«eo (CSVFile) 
SearchStr ■ "M\* 

Period - ZnStr (CSVFile. SearchStr) 
CCVFilel m Mid (CSVFile, Period ^ 2) 
FilaX«an9thl - XM(CSVFilal) 
SeanhStxl • ••• 

Perlodl - instrCCSVFUel. searchstrl) 
CSVFile! - Left(C5VFilel, PerdLodl - 1) 
casemn • Range CB!*} .Value 
If CaaeNum » •• Then GoTo LastLiTie Else 
PoreatJ^ine » . . 

• This part of the macro cuta and paaees the coluims of data «ito the correct order. 

FlleSoLtion ! SSplication.lmStBorC -Enter the directory in which you^jr«^ 
to be stoSd. Thls^SSt directory. Exssple: l:\AREV\MCaiSTEST*, -File Dxrectc 

ry«, "ZsVABTVXMCKXSSZZJaiT') 

DrlveLocatlan m Let t (FllaLocatlon, 1) e*.*^-^ e^^^e p^i-. a ^^i* 

StatsFlle - Application. ZaputBox< -Enter a name for the f «»«*jy sta^tistics ^t.^J^i^n^^S 

eh 1^ cStain all of Sc statistics for each case. Exsn^ae; TBSTSTAT% -Summary seatLlstlcs 

File Name") 

Application. ScreenDpdatlng « False 

No^ooks.Open Filename t--C: \CRAZO\MACROS\CSVIOSTA.BUC« 
CbDrlva DriveLoeatlOD 
QsDlr Fllatioeatlon 

With ActiveWorkbooic 

.Title - •• 

.Subject - ■• 

.Author - "Craig Miller^ 

• Keywords • •• 

ActiSoUkSolc.SaveAS Filename .-StatsFlle, ^iJ;^fSf^!4^!;i^ - 
Passwordi-*-, HrlteBesPassword:«-^, RaadOnlyBeconnianded:- . 

False, CraataBadnipivFalse 
ActiveHindow.Cl se 
Sheets (*Sheetl«) .Select 
Cells. Select 
With Selection 

.ttorisontalAllgnmene • xlCcnter 

• VerticalAlignment ■ xlBottom 

.WrapText - False 

.orientation • xlBorisontal -14 9- 
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Cell Sheet vLOd Source Code 



Eod With 

Coltimns ( "A : A- ) , Select 
Selection .Delete Slii£t:«xlTol<e£c 
ColutBos(**Z:Z") .Select 
Selection. Cut 
ColunsEW<"B;B«) .Select 
Selection. Insert SKiftr-xlToRight 
ColuansCopsD") .Select 
Selection. Cut 
Columns («AK:AK«) .Select 
Selection* Insert Shift : -xlToRicht 
ColumtuCFsK-) .select 
Selection. Cut 
Coluns ( 'AK; AK" ) . Select 
Selection . Insert Shift : »xlToRight 
ColUBU ( "AMsAK? ) • Select 
Selection. Cut 
ColUSBis ( "AA: AA" ) . Select 
Selection. Zuert Shift : KxlTORight 
ColWfis ( 'AEs AE" } . Select 
Selection. Cut 
Columns ( "AS tAB" ) . Select 
Selection. Insert Shift I'XlToRi^ht 
Coluams ("AiA") .select 
Selection. KumberForBat * "9" 
I Detexmines the nunber of rows in the spreadsheet . 
Range r«A2") .select 
c » 2 

Nhile Cells (c, 1) .Forwula <> »» 
c • c + 1 

Wend 

t Rovmds Colunsis B (Summed O.D.), E (Pg DUM , AD (xm Index), and A£ 

(Density) to two significant digits and Colunn C (Area) to five significant digits. 

f M 3 

Columns (f) .Select 

Selection, insert shift : lixlToRight 

Celled, f ) .Forraula - Ceils (1. f - l) .Value 

Cell8(2» f). Formulae "BROUnD(RC[-l] ,2) « 

Cell3(2, f) .AutoFill Destinationi-RangeCCella (2, f ) , Cells (c - f)), _ 

Type : «xlFlllDef suit 
Columns (f) .Select 
Selection. Copy 

Selection. PasteSpecial Paste : -xlValuea , Operation ;-xlKone, _ 
S)cipBlan)ce: -False, Transpose :«False " 
Columns (f - 1) .Select 
Application. cutcopyMode s False 
Selection. Delete Shift :«xlToX«eft 

f • 4 

Colisninsif) .Select 

Selection . Insert Shift : sxlToRight 

Cells (1» f ) .FoztDEuls . Calls (1, f - 1) .Value 

cells(2, f ) .Foxnuia • "«ncaNC(RC£-i] ,4) - 

Cell8(2, f) .AutoFill Destination: -Range (Cells (2, f ) , Cells(c - i, f ) ) , 

Type:-xlFillDefault 
Coluimis(f ) .Select 
Selection. Copy 

Selection. Past especial Paste : axlvalues . Operation :«xlNone. _ 

SUpBlanJcs : KFalse , Transpose i BFelse 
Columns (f - 1) .Select 
Application. CutCopyMode ■ False 
Selection. Delete Shift :>xlToIieft 

t m e 

Columns Cf) .Select 

Selection. Insert Sliif t :exXToRight: 

Cells (1* f) .Formula « Cells (X. £ • 1) .Value 

Cells(2, f) .Formula « "-ROUNDCRCC-l} , 2) " 

Cells(2, C) .AutoFill DefiCination:«Range (Cells (2. f } . CellsCc 

Type : -xlFilloef ault ^ 
Columns (f ) .Select 

Selection. Copy -.1«;n 
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^"ell Sheet vl.Od Source Co^ 



Selttcclon.PasceSpecial Pasce:»xlValueB, Operation : -xlHone , _ 
SkipBlazOcs: -False, TraxisposeieFalse " 
ColunmsCf * 1) .Select 
Application. CutCopyMode - False 
Select ion. delete Shift : exlToLeCt 
f - 31 

Coliunna(f) .Select 

Selection. Insert Shift : -xlToRight 

Cellfid, f). Formula « Celled^ t * 1) .Value 

Cell8(2, f).Foninaa - ->*R002ID(RC[-1] «2) « 

Cells (2, f 3 .AutoFlU De8tination:-lUui9e(Cells(2« f). Cells (c - 1. Z)) . 

Type : sxlFillDefault 
Coluaaxs(f) .Select 
Selection. Copy 

Selection. Pas teSpecial paste x axlV&lues ^ 0Pft>^<LCioa**3clKone, _ 

SklpBlanks 2 «False , Transpose ; -False 
Columns (f - x) .Select 
Application. CutCopyMode « False 
Selectioa.Oelete Shite t-xlToLeft 
f - 32 

Colximns (f ) .Select 

Selection. Insert Shift ; -xlToRight 

Cells (1, r) .Formula - cells (ir f * 1) .Value 

Cells(2, f) .Formula - »i.IU)Tn9D(M(-l] r2) " 

Cells(2, r) .AutoFill Destination: eftanga (Cells (2, t) , Cells{c - 1, t) ) , _ 

Type I -xlFlllDef ault 
Colunaie(f) .Select 
Selection • Copy 

Selection. Pastespeei&l Paste sBxlValueSi Operation :«xlNone« _ 

SklpBlanks: -False, Transpose: -False 
Coluims(f » 1} .Select 
Application. CutCopyMbde - False 
Selection.^elete Shif c 2 -xlToljeft 
Assigns the correct Cell Cl&sslficatlon # to each cell hased upon _ 
MCKl £Uter values (R&D Filter for CAS200) . 
Range ( " AB2 " ) . Select 
c - 2 

Khile Cells (c, 5) .Formula <> 

If Cells(e« 5) .Formula < 5.74 Then 

Cells (c, 28) .Formula - X 
Elself Cells(c, S) .Formula >- 5.74 And Cells(c, 5) .Forwula < 8,63 Then 

Cells(c» 28). Formula - 2 
Elself Cells(c, 5) .Formula >- e.« And CellB(c, 5) .Foraula < 12.85 Then 

Cell8(c. 28). Formula b 3 
Elself Cell5(c, 5) .Formula >- 12.85 And Cell8(c, 5) .Formula < 17.95 Then 

Cells (c« 26) .Form\ila « 5 
Else Cells (c, 28) .Formula - 6 
End Zf 

C « C 4- 1 

Wend 

Foxmata the nuster display for the spreadsheet and sdjtAfics the column width 
Columns ( "B : AK" ) . Select 
Selection. KufflberFormat « " 0.000000" 
Columns («BrB*) .Select 
selection. KumberFormat m "0.00* 
Columns ("CtC") .Select 
Selection. KurabetrFormat - "0.0000" 
Columns ("EiE") .Select 
Selection. numherFormat - "O.oo" 
Columns ("AD: AD") .Select 
Selection.29urober Format - "0.00" 
Columns ( "A£ : AH" ) . Select - 
Selection. KumberFormat a "0.00" 
Columns ("A: AM") .Select: 
Selection • CntlreColumn . AutoFi t 
Range ("A2") .Select 

Application. ScreenOpda ting - True . ^ j 

Saves the newly formatted spreadsheet as an Excel file to the drive _ 
and directory which you specify. 
ChDrive Oriveljocation 51 . 
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:ell Sheet vl.Od Source Cod 



With ActivtHortoooX 
.Titla • 
.Subject • 

.JUitbor - "Craig Miller" 
•JCtywords - 



Snd With 

Xetiv«Worlcfaoolc.SavaJU FilaAUM : •CSVPile2 , FileFoznact-ixlKormal, _ 
Passwords*" wrlc«RasPaaB«ord3»*", RaadOalyltecomnendEd: ■ 
PAlsa, CraateBaekupiaFalse 
Application. ScreenUpdatiag - False 
GoTo Goods tare 
BadS tarts 

FiXaT^nnition - JWlicatioa-IxsputBoxC** Enter the directory in which you want ai\ files 
to be stored. This must be an existing directory. Exas^le: ZsVMEV\MCM\TEST", "Pile Direeto 
ty, •Ii\A»rV\MCM\SEXTJWT-) 

DriveLocation m Xie£t (FileZ«oeation. 1} 

StatsFlle • Application > inpntBox ( "Enter a name £or the Suamary Statistics File: a file whi 
eh viU eontaia all of the fitatintics for each ease. Sxenple: TESTSTAT", "Sunmary Statistics 
File Heme*) 

Application. soreeaopdating - False 

«or)cbeoJes.Open FilensBei* "C:\CRAZGVKACK05\C5VU5T31.BLK" 
CSiDrive DriveX^ocation 
O&ir riletecetlon 

With ActiveWozkbook 

.Title - 

•Subject ■> "" 

.Author - "Craig Miller" 

.Xeywords ■ *" 



Snd Kith 

AetiveWoricbook.Savexs FilenasMt-StatsFile. FileFoxmati»xlllorvial. _ 
Passwordt*"", WriteftesPassvord:**"^ a eadOn X yReeoameaded i ■ ^ 
False » CreateBaetapt -False 
ACtiveWindow. Close 

MsgBox "Open the ••XLS file which you with to converts" 
XliSFile « Application. OetOpenFilename ("£xeel (*.x1b) , *.xls") 
If XUFile • False Then CoTo FnfiT.ine Else 
Wor)cbooks.Open Filensaet-XLSFile 
GoTo Goods tart 
GoodStarc : 

Caseliuifi m Range ( "AS") .Value 
While CaseMum <> •" 
I Application. Sereenopdating - False 
* Select all cells pertaining to this particular cast 
y - 3 

While Cells (y, 1) - CaseMum 
y • y ♦ 1 

Wend . 
Select the cells for one case, copy them co e blank Cell Sheet tenplate, 
and delete the first and last row (blank rows from teaiplate) . 
Bangs (Cells (2, 1> , Cells (y - 1, 35)). Select 
Selection. copy 

Worlcboolcs.Opcn Filename tv^cACRAZGXKAfStOSNCSVlOBMD.BXX" 
Selection. Insert Shift : -xlDovn 
With Selection. Font 

.Heme - "Times New Roman" 
.FOBtStyle - "Regular" 
.sise m 10 

,Stri3eethrough - False 
.superscript - False 
.Subscript ■ False 
.OutliaeJront - False 
, Shadow • False 
.underline * xlNone 
.Col rzndex • xlAutomatic 
Exvd With 
Kith Selection 

.HerisontalAlignmenc - xlCenter 
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Cell Sbeet vl.Ud Source Cou^ 



.VertiealAlignment - xXBocum 
.HrapTtxc « False 
.Orlentai^lon • xlBorizo&cal 
Bad wich 

Vith Selectioa.BordersCxIZieft) 

.Weight - xlThlA 

. Color Index « xlAutomatic 
End UlUi 

With Sel ection. Borders (xUi^ht) 

.wei9ht « xlThin 

.Colorlndex - xlAutomatie 
End With 

With Selection. Borders (xlTop) 

.Weight - xlThin 

.ColorX&dex « xUkutonacic 
End With 

With Selection . Borders {a^Bottom) 

•Weight - xlThlix 

. Colorlndex » xUUitooatie 
End With 

selection . Border Around Lines tyle : «xlKone 
ColtfiB&s ("XsAM") .sntireColuxBn.AutoFlt 
KOws<«3:3") .Select 
Application. CtttCopyMode - TalMc 
selection .Delete shift rBxlUJp 
y . 3 

While Cells (y« 1) • CaseMum 

y • y 4. 1 
Wend 

Range (Cells (y, 1), Cells (y, 39}). Select 
Selection. Delete Shi£t:«XIT;)p 
Makes the first eoluma the cell nusdser and changes the sheet name. 
BaageCAl-) .Select 
ActiveCell.FoxnulaiaCl « "Cell" 
Bange(»A3*) .Select 
ActiveCell . FonmaleElCl - "1« 
Ilaage ( "A4 " ) * select 
ActiveCell. FomulaBXCl - "2" 

set Source • Range (Cells (3 , 1). Cells (4, 1)) 

Set Fill • ;iange(Cellfl(3, I). Cells (y - 1, D) 
Source .AutoFill Destination: «?ill 
Sheets ("CSVldNMD'} .Select 
CaseKum « Triin(CaseMU2tt) 
Sheets CCSVldNHD'] .Kasie «= CaseHum 
Ilange("Al«) .Select 

saves the separeted spreadsheet as an Excel file to the specified directory. 
ChDir Filelocation 

With Act;iveWor)cbobk 
.Title - 
.Subject m 

.Author - "Craig Killer" 
.Xeyi*ords « 
.Conments «» "* 
End With 

ActiveWorkboolc.SaveAs Pilenane: -CaseKum, FileFormat :-xlNor»al. _ 
Pasfiwordt-"-, WriteBesPass«fesdt-"'; xeadOnlyRecotomendeds. _ 
False, Creace&adcup: -False wn.^,v 
Selects the statistical Calculations, pastes thexn to e blsnk spreadsheet. _ 
end cute and copies all of the values to a single row. 
Range (Cells (y ♦ 2, IJ , Cells (y ♦ 13. 39)). Select 
Sal action • Copv 

Selection. Pastespeeial Past .--xlValues. operation :«xlNone. _ 
SkipBlanks t -False , Transpose s -False 
Selection • Cut 

W rkbooks.Add Tes^late: -"Workbook" 
AetiveSheet . Paste 

With Selection. Borders (xlL ft) 
.Height • xlThin 
.Colorlndex « xlAutomatic 
End With 
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Cell Sheet vl.Od Source Codw 



With SftlftctioQ. Borders (xl&i^t) 
.Height > xXThin 

• Colorladax • xlAutoaacic 
End Wich 

With S«lftctioxi.Borders(xXTop) 
.weight - xlThlA 

• Colorlndex » xlAutomatic 
£ttd With 

With Seleetion. Borders (xIBottom) 

.Weight - xlThin 

-ColorXndex • xlAutomatic 
End With 

Selecrtion.BorderAround X«liiestyXe:KxlMone 

Range ( ) .FoznuXe • CaseNUm 

ftuge(«B2tAM2*) .Select 

Selection . Cue 

Reage(«ANX«) .Select 

AetiveSheet • Paste 

Range ('B3tAK3") .Select 

Selection. Cut 

Range t-BZl") .Select 

AotiveSheet • Paste 

Range C "B4 1 AM4 " ) . Select 

Selection .Cut 

Range ( •DLl* ) . Select 

ActiveSheet . Paste 

Range ("BStAMS"} .Select 

Selection. Cut 

Range ("EX1») .Select 

ActiveSheet . Paste 

Range ( ■FJ* ) .Select 

Selection. Cut 

Range .Select 

ActiveSheet . Paste 

Range CFIO") .Select 

Selection. Cut 

Range ( •CKl- ) . Select 

ActiveSheet . Paste 

Range ( 'FIX*' ). Select 

Selection . Cut 

RangoCCLl*} .Select 

ActiveSheet . Paste 

Ro«(«l:l«) .Select 

Selection. Copy 

' Pastes the statiatics tor the case into the specified sunsnary £ile. 

Ch&rive DriveliOcation 

ChDir FileLocation 

Wor)ctoob)c0.ppen Filename :«Stat6File 

Selection. Insert shift fxlPown 

Application •CutCepyMode - False 

Aetivewerkboek. save 

ActiveWindov. Close 

ActiveKorfcbook»Saved « True 

AetiveWindow. Close 

AetiveWorkbook. Saved n True 

AetiveWiadov. Close 

Selection • Delete Shi£ t : arxlTTp 

Range AS") .Select 

Application. ScreenC^dating « True 
'Continues to loop the macro until all o£ the cases contained in thxs vector ^ 
file have been separated and no data is left in the vector file. 

CaseMum ■ Renge('A2") .Valut 

Wend 

'If there is more than one vector file to separate, opeaa the next fxle 
Act iveWorkbook. Saved - True 
ActiveWindow . Close 

Msgl B "Do you wish to convert another «.CSV file?" 
Stylel - vbYesKo 
Titlel - "Continue?" 

Responsei « MegBoxCMagX, Stylel, Titlex) 
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zr Reaponsel » VtiiSlo Then 
GcTo ExidLina 
doe GoTo opasUJAc 
£ad It 

* &ndfi the macro and refreahes cha aereen 
Lascz^ine: 

Activeworkbook. Saved » Trac 

ActivaWindov . close 

Applic&tlcm.ScraenUpdating « Txve 
Eadlitne; 
£nd Sub 
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A method for predicting the loss of organ confinemem, comprising: 

(a) providing prostate cells from a subject; 

(b) analyzing prognostic parameters in the prostate cells, the prognostic parameters 
including nuclear morphometric descriptors and selected biomarkers; and 

10 

(c) predicting the loss of organ confinement by statistical analysis of the predictive 
parameters. 

15 2. The method according to claim 1, wherein the nuclear morphometric descriptors are 
univariate or multivariate significant nuclear morphometric descriptors. 

3. The method according to claim 1, wherein the nuclear morphometric descriptors are 
20 selected from the groiip consisting of difference entropy, information measure A, information 

measure B, maximal correlation coefficient, angular second moment, coefficient of variation, 
contrast, peak transition probability, correlation, diagonal variance, difference moment, 
diagonal moment, inverse difference moment, second diagonal moment, sum average, 
product moment, sum variance, triangular symmetry, sum entropy, blobness, entropy, 
25 difference variance, and standard deviation. 

4. The method according to claim 1 , wherein the nuclear morphometric descriptors are 
selected from the group consisting of object sum, optical density. 
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object size, object shape, picograms of DNA, angular second moment, contrast, correlation, 
difference moment, inverse difference moment, sum average, sum variance, sum entropy, 
entropy, difference variance, difference entropy, information measure A, information 
measure B, maximal correlation coefficient, coefficient of variation, peak transition 
: 5 probability, diagonal variance, diagonal moment, second diagonal moment, product momeni. 
triangular symmetry, standard deviation, cell classification (1 =Hypodiploid. 2=Diploid, 
3=S-Phase, 5=Tetraploid, 6=Hyperplo)d), perimeter, DNA index, density, average optical 
density, feret X, feret Y, maximum diameter, minimum diameter, elongation, run length, 
and configurable run length. 

10 

5» The method according to claim 1, wherein the biomarkers include Gleason score. 

15 6. The method according to claim 5, wherein the biomarkers further include number of 
positive sextant core biopsies, sum % area tumor involvement, and tumor location. 

7. The method according to claim 6, further including the biomarker PSA antigenicity'. 

20 

8. The method according to claim 7, further including the biomarker RT-PCR mRNA 
levels. 

25 

9. The method according to claim 1, wherein the nuclear morphometric descriptors are 
Markovian nucliear texmre factors. 
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10. The method according to claim 1, wherein the statistical analysis is univariate or 
multivariate analysis. 



5 11. The method according to claim 1 , wherein the statistical analysis is performed bv a 
neural network. 

12. The method according to claim 10/ wherein the statistical analysis is multivariate 
10 analysis. 

13. A method for predicting the loss of organ confinement before radical prostatectomy 
comprising the steps of: 

15 

(a) obtaining cells from a subject; 

(b) analyzing selected predictive parameters in the cells; and 

20 (c) predicting the recurrence of prostate cancer in the cell samples by statistical 

analysis of the predictive parameters. 

14. The method according to claim 13, wherein the predictive parameters are nuclear 
25 morphometric descriptors or selected biomarkers. 

15. The method according to claim 13, wherein the predictive parameters are nuclear 
moiphometric descriptors and selected biomarkers. 
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16. The method according to claim 13, wherein the statistical analysis is multivariate 
statistical analysis. 

17. The method according to claim 13, wherein the predictive parameters are selected 
from the group consisting of Gleason score, nuclear morphometric descriptors, serum PSA, 
number of positive sextant core biopsies, tumor location DNA ploidy, and sum % area of 
nimor involvement. 

18. The method according to claim 13, further including RT-PCR mRNA levels. 



15 19. The method according to claim 17, wherein nuclear morphometric descriptors are 

selected from the group consisting of difference entropy, information measure A, information 
measure B. maximal correlation coefficient, angular second moment, coefficient of variation, 
contrast, peak transition probability, correlation, diagonal variance, difference momem, 
diagonal moment, inverse difference moment, second diagonal moment, sum average, 

20 product moment, sum variance, triangular symmetry, sum entropy, blobness. entropy, 
difference variance, and standard deviation. 



20. The method according to claim 17. wherein nuclear morphometric descriptors are 
25 selected from the group consisting of object sum, optical density, 

object size, object shape, picograms of DNA, angular second moment, contrast, correlation, 
difference moment, inverse difference moment, sum average, sum variance, sum entropy, 
entropy, difference variance, difference entropy, information measure A, information 
measure B, maximal correlation coefficient, coefficient of variation, peak transition 
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probability, diagonal variance, diagonal momeni, second diagonal momeni. product moment, 
triangular symmetry, standard deviation, cell classification (1-Hypodiploid. 2 = Diploid. 
3=5S-Phase. 5=TetrapIoid, 6=Hyperploid). perimeter, DNA index. densit\ . average optical 
density, ferei X, feret Y, maximum diameter, minimum diameter, elongation, run length. 
3 and configurable run length. 



21. The method according to claim 19, wherein the nuclear morphometric descriptors are . 
Markovian nuclear texture factors. 

10 

22. A method of determining the loss of organ confinement comprising: 

(a) providing a neural network; 

15 

(b) training the neural network using a first set of prognostic parameters obuined 
from cells known to lose organ confinement and a second set of prognostic parameters 
obtained from cells known not to lose organ confinement; 

20 

(c) analyzing prognostic parameters in tumor cells of an individual having an 
unknown state of organ confinement; and 

(d) predicting the loss of organ confinement in cells of the indi\ idua] having an 
25 unknown state of organ confinement using the prognostic parameters and the trained 

neural network. 
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23. The method according lo claim 22, wherein the prognostic parameters are number of 
positive sextant core biopsies, sum area of tumor involvement, and quantitative nuclear 
grade. 

24. The method according to claim 22, wherein the prognostic parameters are serum 
PSA, tumor location, number of positive sextant core biopsies, sum area of tumor 
involvement, quantitative nuclear grade, DNA ploidy, and post operative Gleason score. 

10 

25. The method according lo claim 24, wherein the prognostic parameters are selected 
from the quantitative nuclear grade group comprising difference entropy, information 
measure A, information measure B, maximal correlation coefficient, angular second moment, 
coefficient of variation, contrast, peak transition probability, correlation, diagonal variance, 

15 difference moment, diagonal moment, inverse difference moment, second diagonal moment, 
sum average, product moment, sum variance, triangular symmetr>', sum entropy, blobness, 
entropy, difference variance, and standard deviation. 



20 26. The method according to claim 24, said prognostic parameters comprising object sum. 
optical densit}', object size, object shape, picograms of DNA, angular second moment, 
contrast, correlation, difference moment, inverse difference moment, sum average, sum 
variance, sum entropy, entropy, difference variance, difference entropy, information measure 
A, information measure B, maximal correlation coefficient, coefficient of variation, peak 

25 transition probability, diagonal variance, diagonal moment, second diagonal moment, product 
moment, triangular symmetry, standard deviation, cell classification (l=Hypodiploid, 
2=Diploid, 3 = S-Phase, 5=Tetraploid, 6=Hyperploid), perimeter, DNA index, density, 
average optical density, feret X, ferei Y. maximum diameter, minimum diameter, elongation, 
run length, and configurable run length. 
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27. The method according to claim 22, funher including the biomarker RT-PCR mRNA 
levels. 

5 

28. The method according to claim 22, wherein the neural network is of the back 
propagation tj'pe. 

10 29. The method of claim 22. wherein the neural network is of a hybrid type. 
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FIG. 31 
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UroScore Plus PSA: Prediction of 
Organ Confined Disease Status 
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FIG. 34 35/37 

UroScore Minus PSA: Prediction 
of Organ Confined Disease Status 
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Quantitative Nuclear Grade and 
Gleason Score Prediction of Organ 
Confined Disease Status 
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UroScore (Pred, Prob.) 
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